(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 

International Bureau 

(43) International Publication Date 
2 October 2003 (02.10.2003) 




PCT 



Illl.ll llllll lll l lll llllllllll lll 

(10) International Publication Number 

WO 03/080856 A2 



(51) International Patent Classification 7 : C12Q 

(21) International Application Number: PCT/US03/08490 

(22) International Filing Date: 19 March 2003 (19.03.2003) 

(25) Filing Language: English 

(26) Publication Language: English 



(30) Priority Data: 

60/365,491 
60/410,618 



19 March 2002 (19.03.2002) US 
13 September 2002 (13.09.2002) US 



(71) Applicant (for all designated Slates except US): CURA- 
GEN CORPORATION [US/US]; 555 Long Wharf Drive, 
11th floor, New Haven, CT 06511 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): LEPLEY, Denise, 
M. [US/US]; 51 Church Street, Branford, CT 06405 (US). 
RIEGER, Daniel, K. [DE/US]; 10A McKinnel Court, 
Branford, CT 06405 (US). TSE, Kam-Fai [CN/US]; 7 
Hurd Bridge Road, Apartment 1, Clinton, CT 06413 (US). 
RASTELLI, Luca [TITUS]; 52 Pepperbush Lane, Guil- 
ford, CT 06437 (US). SMITHSON, Glennda [US/US]; 
125 Michael Drive, Guilford, CT 06435 (US). MESRI, 
Mehdi [IR/US]; 153 Monticello Drive, Brandford, CT 
06405 (US). OOI, Chean, Eng [US/US]; 14 Flax Mill 
Hollow, Branford, CT 06405 (US). ANDERSON, David, 
W. [US/US]; 85 Montoya Drive, Branford, CT 06405 



(US). GUO, Xiaojia [CN/US]; 713 Robert Frost Drive, 
Branford, CT 06405 (US). GIOT, Loic [FR/US]; 99 
Country Way, Madison, CT 06443 (US). STARLING, 
Gary [NZ/US]; 45 Robin Court, Middletown, CT 06457 
(US). 

(74) Agent: ELRIFI, Ivor, R.; Mintz, Levin, Cohn, Ferris, 
Glovsky, and Popeo, P., C, One Financial Center, Boston, 
MA 02111 (US). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EC, EE, ES, FI, GB, GD, GE, GH, 
GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, 
LK, LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, 
MX, MZ, NO, NZ, OM, PH, PL, PT, RO, RU, SC, SD, SE, 
SG, SK, SL, TJ, TM, TN, TR, TT, TZ, UA, UG, US, UZ, 
VC, VN, YU, ZA, ZM, ZW. 

(84) Designated States (regional): ARTPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZM, ZW), 
Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European patent (AT, BE, BG, CH, CY, CZ, DE, DK, EE, 
ES, FI, FR, GB, GR, HU, IE, IT, LU, MC, NL, PT, RO, 
SE, SI, SK, TR), OAPI patent (BF, BJ, CF, CG, CI, CM, 
GA, GN, GQ, GW, ML, MR, NE, SN, TD, TG). 

Published: 

— without international search report and to be republished 
upon receipt of that report 

[Continued on next page] 



= (54) Title: THERAPEUTIC POLYPEPTIDES, NUCLEIC ACIDS ENCODING SAME, AND METHODS OF USE 



= PATHWAY 2968812.CG57008 HAVCR-1" 



< 

v© 
IT) 
00 

00 

o 
O 



CG51 373-10 




LOC1 55465 



(57) Abstract: Disclosed herein are nucleic 
acid sequences that encode novel polypeptides 
encoded by these nucleic acid sequences, 
and antibodies that immunospecifically bind 
to the polypeptide, as well as derivatives, 
variants, mutants, or fragments of the novel 
polypeptide, polynucleotide, or antibody 
specific to the polypeptide. Vectors, 
host cells, antibodies and recombinant 
methods for producing the polypeptides 
and polynucleotides, as well as methods for 
using same are also included. The invention 
further discloses therapeutic, diagnostic and 
research methods for diagnosis, treatment, and 
prevention of disorders involving any one of 
these novel human nucleic acids and proteins. 
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THERAPEUTIC POLYPEPTIDES, NUCLEIC ACIDS ENCODING 
SAME, AND METHODS OF USE 



FIELD OF THE INVENTION 

The present invention relates to novel polypeptides, and the nucleic acids encoding 
them, having properties related to stimulation of biochemical or physiological responses in 
a cell, a tissue, an organ or an organism. More particularly, the novel polypeptides are 
gene products of novel genes, or are specified biologically active fragments or derivatives 
thereof. Methods of use encompass diagnostic and prognostic assay procedures as well as 
methods of treating diverse pathological conditions. 
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BACKGROUND OF THE INVENTION 

Kidney Cancer 

Renal cell carcinoma (RCC or kidney cancer) is the most common primary cancer 
that occurs in the kidney and accounts for more than 85% of all primary renal neoplasms 

5 in adults. Transitional cell carcinomas of the renal pelvis are the next most common and 
account for approximately 8% of all primary renal neoplasms. Malignant renal neoplasms 
can be primary or metastatic tumors. Renal cell carcinoma usually occurs in adults, 
between the ages of 40 and 60 years, with the incidence in men being double of that in 
women. Renal cell carcinoma represents 2% to 3% of all cancers and 2% of all cancer 

10 deaths. In 2002, renal cell cancer was diagnosed in approximately 3 1,000 individuals in 
the United States, almost 40% of whom will eventually die of the disease. 

Renal cell carcinoma occurs predominantly at the age of 60 to 90, with a 1 .27% 
lifetime risk of developing renal cell carcinoma for a 40-year-old man and 0.51% risk of 
death from this disease. The incidence of renal cell cancer increased steadily from 1975 to 

1 5 1 995 in both females (3.1% in white women; 4.3% in black women) and males (2.3% in 
white men; 3.9% in black men), as did the mortality rates. Since 1950, there has been a 
. 126% increase in the incidence of RCC, accompanied by a 37% increase in annual 
mortality caused by RCC. 

Kidney cancer often goes undiagnosed or is mis-diagnosed until it has spread 

20 (metastasized). As a result, 1 5-25% of kidney cancer patients have metastatic disease at 
time of diagnosis. One-third of patients with renal cell carcinoma have distant metastases 
at the time fee primary tumor is diagnosed. Surgery has been the primary therapy for renal 
cell carcinoma for more than 100 years. Historically, kidney cancer has proven to be 
resistant to many chemotherapy agents, and response rates for traditional chemotherapy 

25 used for kidney cancer are often below 10%. Satisfactory treatment of renal cancer is an 
unmet medical need, as existing therapeutics have not been successful in curtailing the 
disease and increasing mortality. Consequently, a therapeutic that can successfully treat 
renal cancer has the beneficial effects of decreasing morbidity and mortality, while 
potentially saving the healthcare system millions of dollars in costs associated with 

30 invasive surgical procedures, immuno- and chemotherapy, and ancillary support services. 
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Inflammation 

Asthma is a chronic inflammatory disorder of the airways in which many cells, 
including mast cells, eosinophils, and T lymphocytes, play a role. In affected individuals 
this inflammation results in recurrent episodes of wheezing, coughing, breathlessness, and 

5 chest tightness. These symptoms are associated with variable airflow limitation that can be 
reversed either spontaneously or with treatment. The Centers for Disease Control have 
shown that the prevalence of asthma has increased approximately 42% in the United States 
from 1982 to 1992, from 34.7 to 49.4 cases per 1000. For ages 5-34 years, the rate 
increased 52% from 34.6 to 52.6 cases per 1000. During this same period, hospitalization 

10 rates and the annual death rate from asthma also increased. From 1980 to 1993, asthma 
accounted for 3850 deaths among persons aged 0 to 24 years, with the annual age-specific 
asthma death rate increasing 118%. In this same period, the annual hospitalization rate for 
asthma among persons aged 0 to 24 years increased 28%. 

Changes in the risk factors thought to cause and worsen asthma are responsible for 

1 5 much of this recent increase in asthma prevalence. Children and adults with severe asthma 
symptoms have lower lung function than those with less severe symptoms. Satisfactory 
treatment of asthma is an unmet medical need, as existing therapeutics have not been 
successful in curtailing the incidence or the severity of the disease. Consequently, a 
therapeutic that can successfully treat asthma has the beneficial effects of decreasing 

20 morbidity. 

Eukaryotic cells are characterized by biochemical and physiological processes 
which under normal conditions are exquisitely balanced to achieve the preservation and 
propagation of the cells. When such cells are components of multicellular organisms such 
as vertebrates, or more particularly organisms such as mammals, the regulation of the 
25 biochemical and physiological processes involves intricate signaling pathways. 

Frequently, such signaling pathways involve extracellular signaling proteins, cellular 
receptors that bind the signaling proteins, and signal transducing components located 
within the cells. 

Signaling proteins can be classified as endocrine effectors, paracrine effectors or 
30 autocrine effectors. Endocrine effectors are signaling molecules secreted by a given organ 
into the circulatory system, which are then transported to a distant target organ or tissue. 
The target cells include the receptors for the endocrine effector, and when the endocrine 



WO 03/080856 



PCT/US03/08490 



effector binds, a signaling cascade is induced. Paracrine effectors involve secreting cells 
and receptor cells in close proximity to each other, for example two different classes of 
cells in the same tissue or organ. One class of cells secretes the paracrine effector, which 
then reaches the second class of cells, for example by diffusion through the extracellular 

5 fluid. The second class of cells contains the receptors for the paracrine effector; binding 
of the effector results in induction of the signaling cascade that elicits the corresponding 
biochemical or physiological effect Autocrine effectors are highly analogous to paracrine 
effectors, except that the same cell type that secretes the autocrine effector also contains 
the receptor. Thus the autocrine effector binds to receptors on the same cell, or on 

10 identical neighboring cells. The binding process then elicits the characteristic biochemical 
or physiological effect. 

Signaling processes can elicit a variety of effects on cells and tissues including by 
way of nonlimiting example induction of cell or tissue proliferation, suppression of growth 
or proliferation, induction of differentiation or maturation of a cell or tissue, and 

1 5 suppression of differentiation or maturation of a cell or tissue. 

Many pathological conditions involve dysregulation of expression of important 
effector proteins. In certain classes of pathologies the dysregulation is manifested as 
diminished or suppressed level of synthesis and secretion of protein effectors. In other 
classes of pathologies the dysregulation is manifested as increased or up-regulated level of 

20 synthesis and secretion of protein effectors. In a clinical setting a subject may be 

suspected of suffering from a condition brought on by altered or mis-regulated levels of a 
protein effector of interest Therefore there is a need to assay for the level of the protein 
effector of interest in a biological sample from such a subject, and to compare the level 
with that characteristic of a nonpathological condition. There also is a need to provide the 

25 protein effector as a product of manufacture. Administration of the effector to a subject in 
need thereof is useful in treatment of the pathological condition. Accordingly, there is a 
need for a method of treatment of a pathological condition brought on by a diminished or 
suppressed levels of the protein effector of interest In addition, there is a need for a 
method of treatment of a pathological condition brought on by a increased or up-regulated 

30 levels of the protein effector of interest 

Therefore there is a need to assay for the level of a protein effector of interest in a 
biological sample from such a subject, and to compare this level with that characteristic of 
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a nonpathological condition. In particular, there is a need for such an assay based on the 
use of an antibody that binds immunospecifically to the antigen. There further is a need to 
inhibit the activity of the protein effector in cases where a pathological condition arises 
from elevated or excessive levels of the effector based on the use of an antibody that binds 
5 immunospecifically to the effector. Thus, there is a need for the antibody as a product of 
manufacture. There further is a need for a method of treatment of a pathological condition 
brought on by an elevated or excessive level of the protein effector of interest based on 
administering the antibody to the subject. 

SUMMARY OF THE INVENTION 

10 The invention is based in part upon the discovery of isolated polypeptides 

including amino acid sequences selected from mature forms of the amino acid sequences 
selected from the group consisting of SEQ ID NO:2n, wherein n is an integer between 1 
and 12. The novel nucleic acids and polypeptides are referred to herein as NOVX, or 
NOV1, NOV2, NOV3, e/c, nucleic acids and polypeptides. These nucleic acids and 

15 polypeptides, as well as derivatives, homologs, analogs and fragments thereof, are 
hereinafter collectively designated as "NOVX" nucleic acid or polypeptide sequences. 

The invention also is based in part upon variants of a mature form of the amino 
acid sequence selected from the group consisting of SEQ ID NO:2n, wherein n is an 
integer between 1 and 12, wherein any amino acid in the mature form is changed to a 

20 different amino acid, provided that no more than 15% of the amino acid residues in the 
sequence of the mature form are so changed. In another embodiment, the invention 
includes the amino acid sequences selected from the group consisting of SEQ ID NO:2n, 
wherein n is an integer between 1 and 12. In another embodiment, the invention also 
comprises variants of the amino acid sequence selected from the group consisting of SEQ 

25 ID NO:2n, wherein n is an integer between 1 and 12 wherein any amino acid specified in 
the chosen sequence is changed to a different amino acid, provided that no more than 1 5% 
of the amino acid residues in the sequence are so changed. The invention also involves 
fragments of any of the mature forms of the amino acid sequences selected from the 
group consisting of SEQ ID NO:2n, wherein n is an integer between 1 and 12, or any other 
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amino acid sequence selected from this group. The invention also comprises fragments 
from these groups in which up to 15% of the residues are changed. 

In another embodiment, the invention encompasses polypeptides that are naturally 
occurring allelic variants of the sequence selected from the group consisting of SEQ ID 

5 NO:2n, wherein n is an integer between 1 and 1 2. These allelic variants include amino 
acid sequences that are the translations of nucleic acid sequences differing by a single 
nucleotide from nucleic acid sequences selected from the group consisting of SEQ ID 
NOS: 2n-l , wherein n is an integer between 1 and 12. The variant polypeptide where any 
amino acid changed in the chosen sequence is changed to provide a conservative 

10 substitution. 

In another embodiment, the invention comprises a pharmaceutical composition 
involving a polypeptide with an amino acid sequence selected from the group consisting of 
SEQ ID NO:2n, wherein n is an integer between 1 and 12 and a pharmaceutical^ 
acceptable carrier. In another embodiment, the invention involves a kit, including, in one 

15 or more containers, this pharmaceutical composition. 

In another embodiment, the invention includes the use of a therapeutic in the 
manufacture of a medicament for treating a syndrome associated with a human disease, 
the disease being selected from a pathology associated with a polypeptide with an amino 
acid sequence selected from the group consisting of SEQ ID NO:2n, wherein n is an 

20 integer between 1 and 12 wherein said therapeutic is the polypeptide selected from this 
group. 

In another embodiment, the invention comprises a method for determining the 
presence or amount of a polypeptide with an amino acid sequence selected from the group 
consisting of SEQ ID NO:2n, wherein n is an integer between 1 and 12 in a sample, the 

25 method involving providing the sample; introducing the sample to an antibody that binds 
immunospecifically to the polypeptide; and determining the presence or amount of 
antibody bound to the polypeptide, thereby determining the presence or amount of 
polypeptide in the sample. 

In another embodiment, the invention includes a method for determining the 

30 presence of or predisposition to a disease associated with altered levels of a polypeptide 
with an amino acid sequence selected from the group consisting of SEQ ID NO:2n, 
wherein n is an integer between 1 and 12 in a first mammalian subject, the method 
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involving measuring the level of expression of the polypeptide in a sample from the first 
mammalian subject; and comparing the amount of the polypeptide in this sample to the 
amount of the polypeptide present in a control sample from a second mammalian subject 
known not to have, or not to be predisposed to, the disease, wherein an alteration in the 

5 expression level of the polypeptide in the first subject as compared to the control sample 
indicates the presence of or predisposition to the disease. 

In another embodiment, the invention involves a method of identifying an agent 
that binds to a polypeptide with an amino acid sequence selected from the group 
consisting of SEQ ID NO:2n, wherein n is an integer between 1 and 12, the method 

10 including introducing the polypeptide to the agent; and determining whether the agent 
binds to the polypeptide. The agent could be a cellular receptor or a downstream effector. 

In another embodiment, the invention involves a method for identifying a potential 
therapeutic agent for use in treatment of a pathology, wherein the pathology is related to 
aberrant expression or aberrant physiological interactions of a polypeptide with an amino 

1 5 acid sequence selected from the group consisting of SEQ ED NO:2n, wherein n is an 
integer between 1 and 12, the method including providing a cell expressing the 
polypeptide of the invention and having a property or function ascribable to the 
polypeptide; contacting the cell with a composition comprising a candidate substance; and 
determining whether the substance alters the property or function ascribable to the 

20 polypeptide; whereby, if an alteration observed in the presence of the substance is not 
observed when the cell is contacted with a composition devoid of the substance, the 
substance is identified as a potential therapeutic agent. 

In another embodiment, the invention involves a method for screening for a 
modulator of activity or of latency or predisposition to a pathology associated with a 

25 polypeptide having an amino acid sequence selected from the group consisting of SEQ ED 
NO:2n, wherein n is an integer between 1 and 12, the method including administering a 
test compound to a test animal at increased risk for a pathology associated with the 
polypeptide of the invention, wherein the test animal recombinantly expresses the 
polypeptide of the invention; measuring the activity of the polypeptide in the test animal 

30 after administering the test compound; and comparing the activity of the protein in the test 
animal with the activity of the polypeptide in a control animal not administered the 
polypeptide, wherein a change in the activity of the polypeptide in the test animal relative 



WO 03/080856 PCT/US03/08490 



to the control animal indicates the test compound is a modulator of latency of, or 
predisposition to, a pathology associated with the polypeptide of the invention. The 
recombinant test animal could express a test protein transgene or express the transgene 
under the control of a promoter at an increased level relative to a wild-type test animal The 
5 promoter may or may not be the native gene promoter of the transgene. 

In another embodiment, the invention involves a method for modulating the 
activity of a polypeptide with an amino acid sequence selected from the group consisting 
of SEQ ID NO:2n, wherein n is an integer between 1 and 12, the method including 
introducing a cell sample expressing the polypeptide with a compound that binds to the 

10 polypeptide in an amount sufficient to modulate the activity of the polypeptide. 

In another embodiment, the invention involves a method of treating or preventing a 
pathology associated with a polypeptide with an amino acid sequence selected from the 
group consisting of SEQ ID NO:2n, wherein n is an integer between 1 and 12, the method 
including administering the polypeptide to a subject in which such treatment or prevention 

1 5 is desired in an amount sufficient to treat or prevent the pathology in the subject. The 
subject could be human. 

In another embodiment, the invention involves a method of treating a pathological 
state in a mammal, the method including administering to the mammal a polypeptide in an 
amount that is sufficient to alleviate the pathological state, wherein the polypeptide is a 

20 polypeptide having an amino acid sequence at least 95% identical to a polypeptide having 
the amino acid sequence selected from the group consisting of SEQ ID NO:2n, wherein n 
is an integer between 1 and 12 or a biologically active fragment thereof. 

In another embodiment, the invention involves an isolated nucleic acid molecule 
comprising a nucleic acid sequence encoding a polypeptide having an amino acid 

25 sequence selected from the group consisting of a mature form of the amino acid sequence 
given SEQ ID NO:2n, wherein n is an integer between 1 and 12; a variant of a mature 
form of the amino acid sequence selected from the group consisting of SEQ ID NO:2n, 
wherein n is an integer between 1 and 12 wherein any amino acid in the mature form of 
the chosen sequence is changed to a different amino acid, provided that no more than 15% 

30 of the amino acid residues in the sequence of the mature form are so changed; the amino 

acid sequence selected from the group consisting of SEQ ID NO:2n, wherein n is an 

integer between 1 and 12; a variant of the amino acid sequence selected from the group 

8 
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consisting of SEQ ID NO:2n, wherein n is an integer between 1 and 12, in which any 
amino acid specified in the chosen sequence is changed to a different amino acid, provided 
that no more than 1 5% of the amino acid residues in the sequence are so changed; a 
nucleic acid fragment encoding at least a portion of a polypeptide comprising the amino 
5 acid sequence selected from the group consisting of SEQ ID NO:2n, wherein n is an 

integer between 1 and 12 or any variant of the polypeptide wherein any amino acid of the 
chosen sequence is changed to a different amino acid, provided that no more than 10% of 
the amino acid residues in the sequence are so changed; and the complement of any of the 
nucleic acid molecules. 

10 In another embodiment, the invention comprises an isolated nucleic acid molecule 

having a nucleic acid sequence encoding a polypeptide comprising an amino acid 
sequence selected from the group consisting of a mature form of the amino acid sequence 
given SEQ ID NO:2n, wherein n is an integer between 1 and 12, wherein the nucleic acid 
molecule comprises the nucleotide sequence of a naturally occurring allelic nucleic acid 

15 variant 

In another embodiment, the invention involves an isolated nucleic acid molecule 
including a nucleic acid sequence encoding a polypeptide having an amino acid sequence 
selected from the group consisting of a mature form of the amino acid sequence given 
SEQ ID NO:2n, wherein n is an integer between 1 and 12 that encodes a variant 

20 polypeptide, wherein the variant polypeptide has the polypeptide sequence of a naturally 
occurring polypeptide variant. 

In another embodiment, the invention comprises an isolated nucleic acid molecule 
having a nucleic acid sequence encoding a polypeptide comprising an amino acid 
sequence selected from the group consisting of a mature form of the amino acid sequence 

25 given SEQ ID NO:2n, wherein n is an integer between 1 and 12, wherein the nucleic acid 
molecule differs by a single nucleotide from a nucleic acid sequence selected from the 
group consisting of SEQ ID NOS: 2n-l , wherein n is an integer between 1 and 1 2. 

In another embodiment, the invention includes an isolated nucleic acid molecule 
having a nucleic acid sequence encoding a polypeptide including an amino acid sequence 

30 selected from the group consisting of a mature form of the amino acid sequence given 
SEQ ID NO:2n, wherein n is an integer between 1 and 12, wherein the nucleic acid 
molecule comprises a nucleotide sequence selected from the group consisting of the 
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nucleotide sequence selected from the group consisting of SEQ ID NO:2n-l, wherein n is 
an integer between 1 and 12; a nucleotide sequence wherein one or more nucleotides in the 
nucleotide sequence selected from the group consisting of SEQ ID NO:2n-l, wherein n is 
an integer between 1 and 12 is changed from that selected from the group consisting of the 

5 chosen sequence to a different nucleotide provided that no more than 1 5% of the 
nucleotides are so changed; a nucleic acid fragment of the sequence selected from the 
group consisting of SEQ ID NO:2n-l , wherein n is an integer between 1 and 12; and a 
nucleic acid fragment wherein one or more nucleotides in the nucleotide sequence selected 
from the group consisting of SEQ ID NO:2n-l, wherein n is an integer between 1 and 12 

10 is changed from that selected from the group consisting of the chosen sequence to a 
different nucleotide provided that no more than 15% of the nucleotides are so changed. 

In another embodiment, the invention includes an isolated nucleic acid molecule 
having a nucleic acid sequence encoding a polypeptide including an amino acid sequence 
selected from the group consisting of a mature form of the amino acid sequence given 

1 5 SEQ ID NO:2n, wherein n is an integer between 1 and 12, wherein the nucleic acid 

molecule hybridizes under stringent conditions to the nucleotide sequence selected from 
the group consisting of SEQ ID NO:2n-l, wherein n is an integer between 1 and 12, or a 
complement of the nucleotide sequence. 

In another embodiment, the invention includes an isolated nucleic acid molecule 

20 having a nucleic acid sequence encoding a polypeptide including an amino acid sequence 
selected from the group consisting of a mature form of the amino acid sequence given 
SEQ ID NO:2n, wherein n is an integer between 1 and 12, wherein the nucleic acid 
molecule has a nucleotide sequence in which any nucleotide specified in the coding 
sequence of the chosen nucleotide sequence is changed from that selected from the group 

25 consisting of the chosen sequence to a different nucleotide provided that no more than 
15% of the nucleotides in the chosen coding sequence are so changed, an isolated second 
polynucleotide that is a complement of the first polynucleotide, or a fragment of any of 
them. 

In another embodiment, the invention includes a vector involving the nucleic acid 
30 molecule having a nucleic acid sequence encoding a polypeptide including an amino acid 
sequence selected from the group consisting of a mature form of the amino acid sequence 
given SEQ ID NO:2n, wherein n is an integer between 1 and 12. This vector can have a 
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promoter operably linked to the nucleic acid molecule. This vector can be located within a 
cell. 

In another embodiment, the invention involves a method for determining the 
presence or amount of a nucleic acid molecule having a nucleic acid sequence encoding a 

5 polypeptide including an amino acid sequence selected from the group consisting of a 
mature form of the amino acid sequence given SEQ ID NO:2n, wherein n is an integer 
between 1 and 12 in a sample, the method including providing the sample; introducing the 
sample to a probe that binds to the nucleic acid molecule; and determining the presence or 
amount of the probe bound to the nucleic acid molecule, thereby determining the presence 

10 or amount of the nucleic acid molecule in the sample. The presence or amount of the 
nucleic acid molecule is used as a marker for cell or tissue type. The cell type can be 
cancerous. 

In another embodiment, the invention involves a method for determining the 
presence of or predisposition for a disease associated with altered levels of a nucleic acid 

15 molecule having a nucleic acid sequence encoding a polypeptide including an amino acid 
sequence selected from the group consisting of a mature form of the amino acid sequence 
given SEQ ID NO:2n, wherein n is an integer between 1 and 12 in a first mammalian 
subject, the method including measuring the amount of the nucleic acid in a sample from 
the first mammalian subject; and comparing the amount of the nucleic acid in the sample 

20 of step (a) to the amount of the nucleic acid present in a control sample from a second 
mammalian subject known not to have or not be predisposed to, the disease; wherein an 
alteration in the level of the nucleic acid in the first subject as compared to the control 
sample indicates the presence of or predisposition to the disease. 

The invention further provides an antibody that binds immunospecifically to a 

25 NOVX polypeptide. The NOVX antibody can be monoclonal, humanized, or a folly 
human antibody. Preferably, the antibody has a dissociation constant for the binding of 
the NOVX polypeptide to the antibody less than 1 x 10' 9 M. More preferably, the NOVX 
antibody neutralizes the activity of the NOVX polypeptide. 

In a further aspect, the invention provides for the use of a therapeutic in the 

30 manufacture of a medicament for treating a syndrome associated with a human disease, 
associated with a NOVX polypeptide. Preferably the therapeutic is a NOVX antibody. 
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In yet a further aspect, the invention provides a method of treating or preventing a 
NOVX-associated disorder, a method of treating a pathological state in a mammal, and a 
method of treating or preventing a pathology associated with a polypeptide by 
administering a NOVX antibody to a subject in an amount sufficient to treat or prevent the 
5 disorder. 

In another embodiment, the invention involves an immunoconjugate. The 
immunoconjugates can be an antibody conjugated to a cytotoxic agent such as a 
chemotherapeutic agent, a toxin (eg., an enzymatically active toxin of bacterial, fungal, 
plant, or animal origin, or fragments thereof), or a radioactive isotope (i.e., a 

10 iadioconjugate). 

The invention further provides methods for treating cancer by administering one or 
more antibodies or immunoconjugates to a patient. The invention also provides methods 
for treating inflammation by administering one or more antibodies or one or more 
immunoconjugates to a patient. 

1 5 In another embodiment, the invention involves protein-protein complexes. These 

complexes can involve any one of polypeptides NOVla - NOV-ld complexed with any 
one of polypeptides NOV2a - NOV2h. Alternatively, the complex can involve a 
fragment of any one of polypeptides NOVla - NOV-ld complexed with a fragment of any 
one of polypeptides NOV2a - NOV2h. The fragments can consist of specific domains of 

20 the polypeptides (e.g. the PKD domain of NOV2a-NOV2h, the mucin domain of NOVla- 
NOVld, or the unglycosylated mucin domain of NOVla-NOVld). 

In another embodiment, the invention involves antibodies which 
immunospecifically bind to any of the NOV1-NOV2 complexes. 

In another embodiment, the invention involves an antibody which disrupts or 

25 neutralizes the interaction between the protein-protein complexes of the invention thereby 
disassociating any NOV1-NOV2 complex or preventing the formation of any NOV1- 
NOV2 complex. For example, antibodies which disrupt or neutralize the interactions 
between the NOV polypeptides include antibodies which immunospecifically bind to the 
PKD domain of NOV2, the mucin domain of NOV1, or the unglycosylated mucin domain 

30 ofNOVl. 

Unless otherwise defined, all technical and scientific terms used herein have the 

same meaning as commonly understood by one of ordinary skill in the art to which this 
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invention belongs. Although methods and materials similar or equivalent to those 
described herein can be used in the practice or testing of the present invention, suitable 
methods and materials are described below. All publications, patent applications, patents, 
and other references mentioned herein are incorporated by reference in their entirety. In 
the case of conflict, the present specification, including definitions, will control. In 
addition, the materials, methods, and examples are illustrative only and are not intended to 
be limiting. 

Other features and advantages of the invention will be apparent from the following 
detailed description and claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a diagram showing proteins that interact with CG57008-02 (HAVCR- 
1), including CG51373 (nephrin 1 like), and LOC155465. 

Figure 2 is a bar graph of the results of an ELISA assay of antibodies CR014.1 .29, 
CR04.2.56.2, CR014.2.59.2, and CR014.2.45.1 against CG57008-02. 

Figure 3 is a bar graph of the results of an ELISA assay of antibodies CR014.1 .29, 
CR04.2.56.2, CR014.2.59.2, and CR014.2.45.1 against irrelevant protein. 

Figures 4A and 4B are photographs showing staining of Renal Cell Cancer (left) 
and Pancreatic Cancer (right) with the anti-CG57008 2.59.2 monoclonal antibody. 

Figure 5 is a bar graph of Clonogenic Assay results of anti-CG57008 monoclonal 
antibody mediated toxin killing in the ACHN kidney cancer cell line. 

Figure 6 is a bar graph of Clonogenic Assay results of anti-CG57008 monoclonal 
antibody mediated toxin killing in the BT549 breast cancer cell line. 

Figure 7 is a bar graph of the results of an ELISA assay showing that Monoclonal 
Antibodies 2.59.2, 2.56.2 and 2.45.1 significantly inhibited IL-4 release from Thl cells 
compared to the control PK16.3 mAb. 

Figure 8 is a bar graph of the results of an ELISA assay showing that Monoclonal 
Antibodies 2.59.2, and 2.45.1 significantly inhibited IL-4 release from Th2 cells compared 
to control PK16.3 mAb. 
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Figure 9 is a bar graph of the results of an ELISA assay showing that Monoclonal 
Antibodies 2.59.2 significantly inhibited IL-5 release from Thl cells compared to control 
PK16.3 mAb. 

Figure 10 is a bar graph of the results of an ELISA assay showing that Monoclonal 
5 Antibodies 2.59.2, and 1.29 significantly inhibited IL-5 release from Th2 cells compared 
to control PK16.3 mAb. 

Figure 11 is a bar graph of the results of an ELISA assay showing that Monoclonal 
Antibodies 2.59.2, 1.29 and 2.56.2 significantly inhibited IL-10 release from Thl cells 
compared to control PK16.3 mAb. 

10 Figure 12 is a bar graph of the results of an ELISA assay showing that Monoclonal 

Antibodies 2.59.2, 1.29 and 2.45.1 significantly inhibited IL-10 release from Th2 cells 
compared to control PK16.3 mAb. 

Figure 13 is a bar graph of the results of an ELISA assay showing that Monoclonal 
Antibodies 2.59.2, 1.29 and 2.56.2 significantly inhibited IL-13 release from Thl cells 

15 compared to control PK16.3 mAb. 

Figure 14 is a bar graph of the results of an ELISA assay showing that Monoclonal 
Antibodies mAbs 2.59.2 and 1.29 significantly inhibited IL-13 release from Th2 cells 
compared to control PK16.3 mAb. 

Figure 15 is a bar graph of the results of an ELISA assay showing that Anti- 
20 CG57008-02 Monoclonal Antibodies did not inhibit IFNy release from Thl cells 
compared to control PK16.3 mAb. 

Figure 16 is a bar graph of the results of an ELISA assay showing that Monoclonal 
Antibodies 2.59.2 and 2.45.1 significantly inhibited IFNy release from Th2 cells compared 
to control PK1 6.3 mAb. 

25 Figure 17 is a bar graph of the results of a clonogenic assay of CAKI-1 cells 

treated with Auristatin E (AE) conjugated antibodies. 

Figure 18 is a bar graph of the results of a clonogenic assay of BT549 cells treated 
with Auristatin E (AE) conjugated antibodies. 
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DETAILED DESCRIPTION OF THE INVENTION 

HAVcr-1 (Hepatitis A Virus cellular receptor 1) is a putative adhesion protein 
involved in renal regeneration. The HAVcr-1 protein is a type 1 membrane protein that 
contains a novel six-cysteine immunoglobulin-like domain and a mucin domain. Ichimura 

5 et. al first identified this molecule, which they designated kidney injury molecule- 1 (KIM- 
1), as being expressed at low levels in normal kidney with dramatically increased 
expression in post-ischemic kidney. (J Biol Chem. 1998 Feb 13;273(7):4135-42.) 
Feigelstock demonstrated that KIM-1 is the human ortholog of Hepatitis A virus receptor 
(J Virol. 1998 Aug;72(8):6621-8.) In polycystic kidney disease, KIM-1 expression is 

10 strongly associated with partial dedifferentiation of epithelial cells (Kuehn EW. Am J 
Physiol Renal Physiol 2002 Dec;283(6):Fl 326-36). Shedding of a soluble domain of 
KIM-1 is proposed to constitute an active mechanism allowing dedifferentiated 
regenerating kidney cells to scatter (Bailly V. J Biol Chem 2002 Oct 18;277(42):39739- 
48). 

15 The HAVcr-1 protein has homology to P-type Trefoil* domains which have been 

shown to induce scattering and cellular invasion by kidney, colon and breast tumor cells. 
(Prest SJ. FASEB J. 2002 Feb 12). Trefoils may act via an anti-apoptotic mechanism by 
making cells resistant to anoikis, an anchorage-related apoptosis in epithelium. (Chen YH. 
Biochem Biophys Res Commun. 2000 Aug 1 1;274(3):576-82.) 

20 Monoclonal antibodies directed against HAVcr-1 protein should block migration 

and induce apoptosis of kidney cancer cells. Since HAVcr-1 gene appears to be highly 
induced in tumors, the HAVcr-1 protein could also be the target of toxin-conjugate 
monoclonal antibodies. 

Timl, a mouse ortholog of HAVcr-1, has been deposited in Genbank and 

25 identified as Tapr, a major T cell regulatory locus that controls the development of airway 
hyperreactivity. (Mclntire JJ. Nat Immunol. 2001 Dec;2(12):l 109-16.) The human 
HAVcr-1 gene also maps to a portion of Chromosome 5 {Tapr) that has been implicated in 
asthma. Timl is expressed by T cells and presumably interacts with an unknown ligand(s) 
on antigen presenting cells (Wills-Karp M. Nature Immunology 2, 1095 - 1096; Mclntire 

30 JJ. Nature Immunology 2, 1 1 09 - 1 1 1 6). 
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Antibodies to HAVcr-1 protein would send a partially activating signal that would 
lead to T cell anergy, act as an antagonist to block T cells from differentiating into Th2 
cells, or act as an agonist to trigger T cell differentiation. Blocking of Th2 effector T cell 
function would decrease inflammation associated with asthma, lupus, and emphysema. 

S The present invention provides novel nucleotides and polypeptides encoded 

thereby. Included in the invention are the novel nucleic acid sequences, their encoded 
polypeptides, antibodies, and other related compounds. The sequences are collectively 
referred to herein as "NOVX nucleic acids" or "NOVX polynucleotides" and the 
corresponding encoded polypeptides are referred to as "NOVX polypeptides" or "NOVX 

10 proteins." Unless indicated otherwise, "NOVX" is meant to refer to any of the novel 

sequences disclosed herein. Table A provides a summary of the NOVX nucleic acids and 



their encoded polypeptides. 

TABLE A. Sequences and Corresponding SEQ ID Numbers 



NOVX 
Assignment 


Internal 
Identification 


SEQ ID NO 
(nucleic acid) 


SEQ n> NO 
(amino acid) 


Homology 


NOVla 


CG57008-03 


1 


2 


Hepatitis A virus cellular 
receptor 1 - Homo sapiens 


NOVlb 


CG57008-01 


3 


4 


Hepatitis A virus cellular 
receptor 1 - Homo sapiens 


NOVlc 


CG57008-02 


5 


6 


Hepatitis A virus cellular 
receptor 1 - Homo sapiens 


NOVld 


CG57008-04 


7 


8 


Hepatitis A virus cellular 
receptor 1 - Homo sapiens 



NOV2a 


CG51373-12 


9 


10 


Sequence 1 from Patent 
WO0200691 - Homo 
sapiens 


NOV2b 


CG5 1373-02 


11 


12 


Sequence 1 from Patent 
WO0200691 - Homo 
sapiens 


NOV2c 


CG51373-03 


13 


14 


Sequence 1 from Patent 
WO0200691 - Homo 
sapiens 


NOV2d 


CG5 1373-07 


15 


16 


Sequence 1 from Patent 
WO0200691 - Homo 
sapiens 


NOV2e 


CG51373-10 


17 


18 


Sequence 1 from Patent 
WO0200691 - Homo 
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sapiens 


NOV2f 


CG5 1373-11 


19 


20 


Sequence 1 from Patent 
WO0200691 - Homo 
sapiens 


NOV2g 


CG51373-13 


21 


22 


Sequence 1 from Patent 
WO0200691 - Homo 
sapiens 


NOV2h 


CG51373-14 


23 


24 


Sequence 1 from Patent 
WO0200691 - Homo 
sapiens 



Table A indicates the homology of NOVX polypeptides to known protein families. 
Thus, the nucleic acids and polypeptides, antibodies and related compounds according to 
the invention corresponding to a NOVX as identified in column 1 of Table A are useful in 
5 therapeutic and diagnostic applications implicated in, for example, pathologies and 
disorders associated with the known protein families identified in column 5 of Table A. 

Polynucleotide and Polypeptide Sequences, and Homology Data 

NOV1: 

A NOV1 clone was analyzed, and the nucleotide and encoded polypeptide 
1 0 sequences are shown in Table 1 A. 



Table 1 A* NOV1 Sequence Analysis 


NOVla, CG57008-03 
DNA Sequence 


SEQIDNO: 1 


1017 bp 


ORF Start: at 1 


ORF Stop: end of sequence 



TCTGTAAAGGTTGGTGGAGAGGCAGGTCCATCTGTCACACT^ 

ATCAATGTGCTGGAATAGAGGCTCATGTTCTCTATTCACATGCCAAAATGGCATTGTCTGGACC 

GAACCCACGTCACCTATCGGAAGGACACACGCTATAAGCTATTGGGGGACCTTTCAAGAAGGGATGTC 

TCTTTGACCATAGAAAATACAGCTGTGTCTGACAGTGGCGTATATTGTTGCCGTGTTGAGCACCGTGG 

GTGGTTCAATGACATGAAAATCACCGTATCATTGGAGATTCT 

TTGTCACAACTGTTCCAACCGTCACGACTGTTCGAACGAGC 

CCAACGACAACTGTTCCAACAACAATGAGCATTCCAACGACAA 

TTCAACGACAACGAGCGTTCCAACGACAACGAGCATTCCAACAACAACAAGTC 

CGGTCTCTACCTTTGTTCCTCCAATGCCTTTGCCCAGGCAGAAC 

TCTTCACCTCAGCCAGCAGAAACCC71CCCTACGACACTGCAGGG 

CTCACCATTGTACTCTTACACAACAGATGGGAATGACACCGTGACAGAGTCTT 

ATAACAATCAAACTCAACTGTTCCTAGAACATAGTCTACTGACGGC 

GCTGGAGTCTGTATTTCTGTCTTGGTGCTTCTTGCTCTTTTGGGTCT 

CTTCAAAAAGGAGGTTCAACAACTAAGTGTTTCATTTAGCAGCCTTCAAATTAAA 

CAGTTGAAAAGGAAGTCCAAG CAGAAGACAATATCT ACATTGAGAATAGTCTTTATG CCACGGAC 



NOVla, CG57008-03 
Protein Sequence 


SEQIDNO: 2 


339 aa 


MW at 36608. lkD 


SVKVGGEAGPSVTLPCHYSGAVTSMCWNRGSCSLFTCQNGIVWTNGTHVTyRKDTRYKLLGDLSRRDV 
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SLTIEOTAVSDSGVYCCRVEHRGWFNDMKITVSLEIV 

PTTTVPTTMS I PTTTTVPTTMTVSTTTSVPTTTS I PTTTS VPVTTTVSTFVP PMPLPRQNHEPVATSP 
SSPQPAETHPTTLQGAIRREPTSSPLYSYTTDGNDTVTESSDGLWNNNQTQLFLEHSLLTANTTKGIY 
AGVC I SVLVLLALLGVI I AKKYFFKKEVQQLSVSFSSLQI KALQNAVEKEVQAEDNI YIENSLYATD 


NOVlb,CG57008-01 
DNA Sequence 


SEQIDNO:3 


1440 bp 


ORF Start: ATG at 52 


ORF Stop: end of sequence 



GTTACCCAGCATTGTGAGTGACAGAGCCTGGATCTGAACGCTGATCCCATAATGCATCCTCAAGTGGT 



CATCTOAAGCCTCATCCTACATCTGGCAGATTCTGTAGCTGGTTCTC 

GTCCATCTGTCACACTACCCTGCCACTACAGTGGAGCTGTCACATCAATGTGCTGGAATAGAGGCTCA 

TGTTCTCTATTCACATGCCAAAATGGCATTGTCTGGACCAATGGAACCCACGTCACCTATCGGAAGGA 

CACACGCTATAAGCTATTGGGGGACCTTTC^GAAGGGATGTCTCTTTGACCATAGAAAATACA 

TGTCTGACAGTGGCGTATATTGTTGCCGTGTTGAGCACCGTGGGTGGTTCAATGACATGAAAATCACC 

GTATCATTGGAGATTGTGCCACCCAAGGTCACGACTACTCCAATTGTCACAACTGTTCCAACCGTCAC 

GACTGTTCGAACGAGCACCACTGTTCCAACGACAACGACTGTTCCAACGACAACTGTTCCAACAACAA 

TGAGCATTCCAACGACAACGACTGTTCCGACGACAATGACTGTTTCAACGACAACGAGCG 

ACAACGAGCATTCCAACAACAACAAGTGTTCCAGTGACAACA^ 

GCCTTTGCCCAGGt^GAACCATGAACCAGTAGCCACTTCAC 

ACCCTACGACACTGCAGGGAGCAATAAGGAGAGAACCCACCAGCTCACCA^ 

GATGGGAATGACACCGTGACAGAGTCTTCAGATGGCCTTTGGAATAACAATCAAACTCAACTGTTCCT 

AGAAC^TAGTCTACTGACGGCCAATACCACTAAAGGAATCTATGCTGGAG 

TGCTTCITGCTCTTTTGGGTGTCATCATTGCCAAAAA 

AGTGTTTCATTTAGCAGCCTTCAAATTAAAGCTTTGCAAAATGCAGTTGAAAA 

AGACAATAT CTACATTGAGAATAGTCTTTATGC CACGGACTAAGA C CCAGTGGTGCT CTTTGAGAGTT 



TACGCCCATGACTGCAGAAGACTGAACAGGTATCAGCACATCAGATGTCTTTTAGACTCCAAGACAAT 



TTTTCTGTTTCAGTTTCATCTGGCATTCCAACATGTCAGTGATACTGGGTAGAGTAACTCTCCCACTC 



CAAACTGTGTATAGTCAACCTCATCATTAATGTAGTCCTAATTTGTTTTGCTAAAACTGGCTCAATCC 



TTCTGATCATTGCAGAGTTTTCTCrrCAAACATGAACACTTTAGAATTGTATGTTCTCTTTAGACCCCA 



TAAATCCTGTAT 



NOVlb, CG57008-01 
Protein Sequence 



SEQIDNO:4 



359 aa 



MWat38703.6kD 



MHPQWILSLILHLADSVAGSVKVGGE1AGPSVTLPCHYSGAVTSMCWNRGSCSLFTCQNGIVWTNGTH 
VTYRKiyrRYKLLGDLSRRDVSLTIENTAVSDSGVYCCRVEHRGWFNDMKITVSLEIVPPKVT T^ 
TVPTVTTVRTST WPTTTTVPTTTVPTTMS I PTTTTVPTTMTVSTTTS VPTTTS I PTTTS VP VTTTVS 
TFVPPMPLPRQNHEPVATSPSSPQPAETHPTTLQGAIRREPTSSPLYSYTTDGND1VTESSDGLWNNN 
QTQLFLEHSLLTANTTKGI YAGVCI SVLVLLALLGVI I AKKYFFKKEVQQLSVS FS SLQ I KALQNAVE 
KEVQ AEDN I Y I EN SLYATD 



NOVlc, CG57008-02 
DNA Sequence 



SEQIDNO:5 



ORF Start: at 1 



789 bp 



ORF Stop: end of sequence 



TCTGTAAAGGTTGGTGGAGAGGCAGGTCCATCTGTCACACTACCCTGCCACTACAGTGGAGCTGTCAC 
ATCAATGTGCTGGAATAGAGGCTCATGTTCTCTATTCACATGCCAAAATC 
GAACCCACGTCACCTATCGGAAGGACAGACGCTATAAGCTATTGGGGGACCT^ 
TCTTTGACCATAGAAAATACAGCTGTGTCTGACAGTGGCGTATATTGTTGCCtf 

GTGGTTCAATGACATGAAAATCACCGTATCATTGGAGATTGTGCCACCCAAGGTCACGACTACTCCAA 

TTGTCACAACTGTTCCAACCGTCACGACTGTTCGAACGAGCACCACTGTTCCAACGACAA 

CGAACGACAACTGTTCCAACAACAATGAGCATTCCAACGACAAC^ 

TTCAACGACAACGAGCGTTCCAACGACAACGAGCATTCCAACAACAACAAGTGTTC 

CGGTCTCTACCTTTGTTCCTCGAATGCCTTTGCCCAGGCAGAACCATC 

TCTTCACCTCAGCCAGCAGAAACCCACCCTACGACACTGCAGGGAGCAATAAGGAGAGAACCC^ 
CrCACCATTGTACTCTTACACAACAGATGGGAATGACACCGTGACAG^ 

ATAACAATCAAACTCAACTGTTCCTAGAACATAGTCTACTG 



NOVlc, CG57008-02 
Protein Sequence 



SEQIDNO:6 



263 aa MW at 28270.5kD 
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SVKVGGEAGPSVTLPCHYSGAVTSMCWNRGSCSLFTCQNG 

SLTIENTAVSDSGVYCCRVEHRGWFNDMKITVSLEIVPPKVTTTPIVTIVPTVTTTO 
PTTTVPITMSIPTTTTVPTTMTVSTTTSVPTTTSIPT^ 

SSPQPAETHPTTLQGAIRREPTSSPLYSYTTDGNDTVTESSDGLWNNNQTQLFLEHSLL 


NOVld, CG57008-04 
DNA Sequence 


SEQ ED NO: 7 343 bp 


ORF Start: at 1 1 |ORF Stop: end of sequence 



CACCGGATCCTCTGTAAAGGTTGGTGGAGAGGCAGGTCCATCTGTCACACTACCCTGCCACTACAGTG 



GAGCTGTCACATCAATGTGCTGGAATAGAGGCTCATGTTCTCTATTCACATGCCAAAATGGCATTGTC 
TGGACCAATGGAACCCACGTCACCTATCGGAAGGACACACGCTATAAGCTATTGGGGGACCTTTCAAG 
AAGGGATGTCTCTTTGACCATAGAAAATACAGCTGTGTCTGACAGTGGCGTATATTGTTGCCGTGTTG 
AGCACCGTGGGTGGTTCAATGACATGAAAATCACCGTATCATTGGAGATTGTGCCACCCAAGGTCGAC 
GGC 



NOVld, CG57008-04 
Protein Sequence 



SEQ ID NO: 8 



108 aa 



MW at!1916.5kD 



S VKVGGEAGPSVTL PCHYSGAVTSMCWNRGSCSLFTCQNG I VWTNGTHVTYRKDTRY KLLGDLSRRDV 
SLTIENTAVSDSGVYCCRVEHRGWFNDMKITVSLEIVPPK 



A ClustalW comparison of the above protein sequences yields the following 
sequence alignment shown in Table IB. 



Table IB. Comparison of the NO VI protein sequences. 



NOVla SVKVGGEAGPSVTLPCHYSGAVTSMCWNRGSCSLFTCQNG 

NOVlb MHPQWILSLILHLADSVAGSVKVGGEAGPSVTLPCHYSGAVTSMCWNRGSCSLFTCQNG 

NOV1 C -- - S VKVGGEAGPSVTLPCHYSGAVTSMCWNRGSCSLFTCQNG 

NOVld SVKVGGEAGPSVTLPCHYSGAVTSMCWNRGSCSLFTCQNG 

NOVla IWTNGraVTYRKDTRYKLLGDLSRRDVSLTIEN 

NOVlb IWTNGTHVTYRKDTRYKLIX3DLSRRDVSLTIENTAVSDSGVYCCRVEHRGWFNDMKITV 

NOVlc IWTNGTHVTYRKDTRYKLLGDLSRRDVSLTIENTAVSDSGVYCCRVEHRGWFNDMKITV 

NOVld IVWTNGTHVTYRKDTRYKLLGDLSRRDVSLTIENTAVSDSGVYCCRVEHRGWFNDM 

N0V1 a SLEI VPPKVTTTP I VTTVPTVTTVRTSTTVPTTTTVPTTTVPTTMS I PTTTTVPTTMTVS 

NOVlb SLEI VPPKVTTTP I VTWPTVTTVRTSTTVPTTTTVPTTTVPTTMS I 

NOVlc S LE I VP P KVTTTP I VTTVPTVTTVRTSTTVPTTTTVPTTTVPTTMS I PTTTTVPTTMTVS 

NOVld SLEIVPPK- - 

N0V1 a TTTSVPTTTS I PTTTS VP VTTTVSTFVPPMPLPRQNHE PVATS PSS PQP AETHPTTLQGA 

NOVlb TTTSVPTTTS I PTTTSVPVTTTVSTFVPPMPLPRQNHE PVATS PSS PQP AETHPTTLQGA 

NOVlc TTTSVPTTTS I PTTTSVPVTTTVSTFVPPMPLPRQNHE PVATS PSS PQP AETHPTTLQGA 

NOVld 

NOVla I RREPTSSPLYSYTTDGNDTVTESSDGLWNNNQTQLFLEHSLLTANTTKGI YAGVCI SVL 

NOVlb I RREPTSSPLYSYTTDGNDTVTESSDGLWNNNQTQLFLEHSLLTANTTKGI YAGVCI SVL 

NOVlc IRREPTSSPLYSYTTIX3NDTVTESSDGLWNNNQTQLFLEHSLL- - 

NOVld 

NOVla VLLALLGVI I AKKY FFKKEVQQLSVS FSSLQI KALQNAVEKEVQAEDNI Y I ENSLYATD 

NOVlb VLLALLGVI I AKKYFFKKEVQQLSVS FSSLQI KALQNAVEKEVQAEDNI Y I ENSLYATD 

NOVlc - " 
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NOVld 


















NOVla 


(SEQ 


ID 


NO: 


2) 


NOVlb 


(SEQ 


ID 


NO: 


4) 


NOVlc 


(SEQ 


ID 


NO: 


6) 


NOVld 


(SEQ 


ID 


NO: 


8) 



Further analysis of the NOVla protein yielded the following properties shown in 
Table 1C. 



Table 1C. Protein Sequence Properties NOVla 



SignalP analysis: 



No Known Signal Sequence Predicted 



PSORT II analysis: 



PSG: a new signal peptide prediction method 

N- region: length 7; pos.chg 1; neg.chg 1 
H- region: length 21; peak value 7.47 
PSG score: 3.07 

GvH: von Heijne's method for signal seq. recognition 
GvH score (threshold: -2.1) : -7.40 
possible cleavage site: between 24 and 25 

»> Seems to have no N-terminal signal peptide 

ALOM: Klein et al's method for TM region allocation 
Init position for calculation: 1 

Tentative number of TMS(s) for the threshold 0.5: 1 
Number of TMS(s) for threshold 0.5: 1 

INTEGRAL Likelihood =-14.49 Transmembrane 275 - 291 
PERIPHERAL Likelihood = 6.58 (at 176) 
ALOM score: -14.49 (number of TMSs: 1) 

MTOP: Prediction of membrane topology (Hartmann et al . ) 
Center position for calculation: 282 
Charge difference: 2.5 C( 3.0) - H( 0.5) 
C > N: C-terminal side is inside 

>»Caution: Inconsistent mtop result with signal peptide 
>>> Single TMS is located near the C-terminus 

»> membrane topology: type Nt (cytoplasmic tail 1 to 274) 

MITDISC: discrimination of mitochondrial targeting seq 
R content: 2 Hyd Moment (75) : * 7.31 

Hyd Moment(95): 9.99 G content: 7 
D/E content: 2 S/T content: 12 

Score: -4.31 

Gavel: prediction of cleavage sites for mitochondrial preseq 
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R-2 motif at 62 YRK | DT 

NUCDISC: discrimination of nuclear localization signals 
pat 4 : none 
pat 7: none 
bipartite: none 

content of basic residues: 6.8% 
NLS Score: -0.47 

KDEL : ER retention motif in the C- terminus: none 

ER Membrane Retention Signals: none 

SKL: peroxisomal targeting signal in the C- terminus: none 

PTS2 : 2nd peroxisomal targeting signal : none 

VAC: possible vacuolar targeting motif: none 

RNA-binding motif: none 

Actinin-type actin-binding motif: 
type 1: none 
type 2 : none 

NMYR: N-myristoylation pattern : none 

Prenylation motif: none 

memYQRL : transport motif from cell surface to Golgi: none 

Tyrosines in the tail: too long tail 

Dileucine motif in the tail: found 
LL at 59 
LL at 262 

checking 63 PROSITE DNA binding motifs: none 

checking 71 PROSITE ribosomal protein motifs: none 

checking 33 PROSITE prokaryotic DNA binding motifs: none 

NNCN: Reinhardt's method for Cytoplasmic/Nuclear discrimination 
Prediction: nuclear 
Reliability: 55.5 

COIL: Lupas's algorithm to detect coiled-coil regions 
total: 0 residues 
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Final Results (k = 9/23) : 

30.4 %: nuclear 

21.7 %: cytoplasmic 

13.0 %: mitochondrial 

13.0 %: Golgi 

8.7 %: vesicles of secretory system 

8.7 %: endoplasmic reticulum 

4.3 % : peroxisomal 

>> prediction for CG57008-03 is nuc (k=23) 



A search of the NOVla protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 



several homologous proteins shown in Table ID. 



Table ID. Geneseq Results for NOVla 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOVla 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAW38336 


Human kidney injury related 
molecule (KIM) - Homo sapiens, 
334 aa. [WO9744460-A1, 27- 
NOV-1997] 


1..303 
21. .323 


303/303 (100%) 
303/303 (100%) 


e-177 


AAR92803 


Hepatitis A virus receptor - 
Cercopithecus aethiops, 451 aa. 
[WO9604376-A1, 15-FEB-1996] 


1..337 
21. .437 


264/417 (63%) 
302/417(72%) 


e-145 


AAW38334 


Rat kidney injury related 
molecule (KIM) - Rattus sp, 307 
aa. [WO9744460-A1, 27-NOV- 
1997] 


4..331 
25..298 


135/332(40%) 
177/332 (52%) 


8e-52 


AAM39027 


Human polypeptide SEQ ID NO 
2172 - Homo sapiens, 378 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


9..214 
33..222 


86/209 (41%) 
113/209(53%) 


3e-32 


AAY25768 


Human secreted protein encoded 
from gene 58 - Homo sapiens, 379 
aa. [W09938881-A1, 05-AUG- 
1999] 


9..214 
33..222 


86/209 (41%) 
113/209(53%) 


3e-32 



5 In a BLAST search of public sequence databases, the NOVla protein was found to 



have homology to the proteins shown in the BLASTP data in Table IE. 
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Table IE. Public BLASTP Results for NOVla 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOVla 
Residues/ 
Match 

U act fl 11 etc 

nesmues 


Identities/ 
Similarities for 
the Matched 

Pnrtinn 


Expect 
Value 


043656 


Hepatitis A virus cellular receptor 1 
- Homo sapiens (Human), 359 aa. 


1..339 
21..359 


339/339 (100%) 
339/339 (100%) 


0.0 


Q96D42 


Hypothetical protein - Homo 
sapiens (Human), 364 aa. 


1..339 
21..364 


338/344 (98%) 
338/344 (98%) 


0.0 


046598 


Hepatitis A vims cellular receptor 1 
long form (Hepatitis A virus cellular 
receptor 1 short form) - 
Cercopithecus aethiops (Green 
monkey) (Grivet), 478 aa. 


1..337 
26..464 


261/439 (59%) 
301/439 (68%) 


e-139 


018984 


Hepatitis A vims receptor - 
Cercopithecus aethiops (Green 
monkey) (Grivet), 460 aa. 


99..337 
209..446 


178/239(74%) 
203/239(84%) 


6e-99 


046597 


Hepatitis A vims cellular receptor 1 
long form (Hepatitis A vims cellular 
receptor 1 short form) - 
Cercopithecus aethiops (Green 
monkey) (Grivet), 474 aa. 


99.337 
223..460 


177/239(74%) 
205/239 (85%) 


le-98 



PFam analysis predicts that the NOVla protein contains the domains shown in the 



Table IF. 



Table IF. Domain Analysis of NOVla 


Pfam Domain 


NOVla Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


>g 


9..87 


19/84 (23%) 
54/84 (64%) 


0.0059 



5 



24 



WO 03/080856 



PCT/US03/08490 



A full-length human kidney injury related molecule was recently cloned and is 
described in WO97/44460, which is incorporated by reference in its entirety. 



Table 1G. Brief description of relevant variants 



CGUID-Variant 


Description 


Size 


CG57008-01 
(NOVlb) 


Full length protein 


359 aa 


CG57008-02 
(NOVlc) 


Soluble extracellular domain 


263 aa 


CG57008-03 
(NOVla) 


Mature Protein (first 20aa comprising the signal sequence 
has been removed ) 


339 aa 


CG57008-04 
(NOVld) 


N-terminal extracellular fragment 


108aa 



5 NOV2: 

A NOV2 clone was analyzed, and the nucleotide and encoded polypeptide 



sequences are shown in Table 2A. 



Table 2A. NOV2 Sequence Analysis j 




SEQIDNO:9 J2336 bp f 


NOV2a, 
CG51373-12 
DNA Sequence 


CACCAAGCTTATGCTGAGCCTCCTCGTCTGGATCCT 

CAAGGGACCCAGACCCGCTTCAGCCAGGAGCCAGCTGACCAGACGGTGGTGGCTGGAC 
AGCGGGCCGTGCTCCCCTGTGTGCTGCTCAACTACTCTGGAATTGTGCAATGGACCAA 
GGACGGGCTGGCCCTGGGC^TGGGCCAGGGCCTCAAAGCCTGGCCACGGTACCGGGTT 
GTGGGCTCCGCAGACGCTGGGCAGTACAACCTGGAGATCACAGATGCTGAGCTCTCTG 
ACGACGCCTCTTACGAGTGCCAGGCCACGGAGGCCGCCCTGCGCTCTCGGCGGGCCAA 
ACTCACCGTGCTCATCCCCCCAGAGGACACCAGGATTGACGGAGGCCCTGTGATTCTA 
CTGCAGGCAGGCACCCCCCACAACCTCACATGCCGGGCCTTCAATGCGAAGCCTGCTG 
CCACCATCATCTGGTTCCGGGACGGGACGCAGCAGGAGGGCGCTGTGGCCAGCACGGA 
ATTGCTGAAGGATGGGAAGAGGGAGACCACCGTGAGCCAACTGCTTATTAACCCCACG 
GACCTGG ACATAGGGCGTGTCTTCACTTGC CGAAGC ATGAACGAAGCCAT CCCTAGTG 
GCAAGGAGACTTCCATCGAGCTGGATGTGCACCACCCTCCTACAGTGACCCTGTCCAT 
TGAGCCACAGACGGTGCAGGAGGGTGAGCGTGTTGTCTTTACCTGCCAGGCCACAGCC 
AACCCCGAGATCTTGGGCTACAGGTGGGCCAAAGGGGGTTTCTTGATTGAAGACGCCC 
ACGAGAGTCGCTATGAGACAAATGTGGATTATTCCTTTTTCACGGAGCCTGTGTCTTG 
TGAGGTTCACAACAAAGTGGGAAGCACCAATGTCAGCACTTTAGTAAATGTCCACTTT 
GCTCCCCGGATTGTAGTTGACCCCAAACCCACAACCACAGACATTGGCTCTGATGTGA 
CCCITACCTGTGTCIXK^TTGGGAATCCCCCCCTCACTCTCACCTGGACCAAAAAG^ 
CTCAAATATGGGGCCCAGGCCTCCTGGCTCCCCACCCGAGGCT 

GTCCTGAGTAACAGCAACCAGCTGCTGCTGAAGTCGGTGACTCAGGCAGACGCTGGCA 

CCTACACCTGCCGGGCCATCGTGCCTCGU^TCGGAGTGGCTGAGCGGGAGGTGCCGCT 

CTATGTGAACGGGCCCCCCATCATCTCCAGTGAGGCAGTGCAGTATGCTGTGAGGGGT 

GACGGTGGCAAGGTGGAGTGTTTCATTGGGAGCACACCACCCCCAGACCGCATAGCAT 

GGGCCTGGAAGGAGAACTTCTTGGAGGTGGGGACCCTGGAACGCTATACAGTGG 

GACCAACTCAGGCAGTGGGGTGCTATCCACGCTCACCATCAACAATGTCATGGAOT 

GACTTTCAGACTCACTACAACTGCACCGCCTGGAACAGCTTCGGGCC^ 

TCATCCAGCTGGAAGAGCGAGAGGTGTTACCTGTGGGCATCATAGCTGGGGCCACCAT 

CGGCGCGAGCATCCTGCTCATCTTCTTCTTCATCGCCTTGGTATTCTTCCT 
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CGCCGCAAAGGCAGTCGCAAAGACGTGACCCTGAGGAAGCTGGATATCAAC3GTGGAGA 
CAGTGAACCGAGAGCCACTTACGATGCATTCTGACCGGGAGGATGACACCGCCAGCGT 
CTCCACAGCAACCCGGGTCATGAAGGCCATCTACTCGTCGTTTAAGGATGATGTGGAT 
CTGAAGCAGGACCTGCGCTGCGACACCATCGACACCCGGGAGGAGTATGAGATGAAG6 
ACCCCACCAATGGCTACTACAACGTGCX5TGCCCATGAAGACCGCCCGTCTTCCAGGGC 
AGTGCTCTATGCTGACTACCGTGCCCCTGGCCCTGCCCGCTTCGACGGCCGCCCCTCA 
TCCCGTCTCTCCCACTCCAGCGGCTATGCCCAGCTCAACACCTATAGCCGGGGCCCTG 
CCTCTGACTATGGCCCTGAGCCCACACCCCCTGGCCCTGCTGCCCCAGCTGGCACTGA 
CACAACCAGCCAGCTGTCCTACGAGAACTATGAGAAGTTCAACTCCCATCCCTTCCCT 
GGGGCAGCTGGGTACCCCACCTACCGACTGGGCTACCCCCAGGCCCCACCCTCTGGCC 
TGGAGCGGACCCCATATGAGGCGTATGACCCCATTGGCAAGTACGCCACAGCCACTCG 
ATTCTCCTACACCTCCCAGCACTCGGACTACGGCCAGCGATTCCAGCAGCGCATGCAG 
ACTCACGTGGTCGACG 




ORF Start: ATG at 11 


|ORF Stop: at 2330 j 




SEQ ID NO: 10 


773 aa |mW at 85048.5kD 


NOV2a, 
CG5 1373-12 
Protein Sequence 


MLSLLWILTLSDTFSQGTQTRFSQEPADQTWAGQRAVLPCVLLNYSGIVQWTKE)GL 
ALGMGQGL KAW PRYRWGS AD AGQ YNLE ITDAE LSDDAS YECQ ATEAALRSRRAKLTV 
LI PPEDTRIDGGPVILLQAGTPHNLTCRAFNAKPAATI I WFRDGTQQEGAVASTELLK 
DGKRETTVSQLL INPTDLD I GR VFTCRSMNE AI P SGKETS I ELDVHH P PTVTLS I E PQ 
TVQEGERWFTCQATANPE I LGYRWAKGG FLI E DAHE SRYETNVDYS FFTE PVS CE VH 
NKVGSTNVSTLVNVHFAPRIVVDPKPTTTDIGSDVT^ 

GPRPPGSPPEAALSAQVLSNSNQLLLKSVTQADAGTYTCRAIVPRIGVAEREVPLYVN 
GPPIISSEAVQYAVRGDGGKVECFIGSTPPPDRIAWAWKENFLEVGTLERYTVERTNS 
GSGVL STLTI NNVMEADFQTH YNCTAWNS FGPGTAI I QLEEREVLPVG 1 1 AGAT I GAS 
ILLIFFFIALVFFLYRRRKGSRKDVTLRKLDIKVETVNREPLTMHSDREDDTASVSTA 
TRVMKAIYSSFKDDVDLKQDLRCDTIDTREEYEMKDPTNGYYNVRAHEDRPSSRAVLY 
ADYRAPGPARFDGRPSSRLSHSSGYAQLNTYSRGPASDYGPEPTPPGPAAPAGTDTTS 
QLSYENYEKFNSHPFPGAAGYPTYRLGYPQAPPSGLERTPYEAYDPIGKYATATRFSY 
TSQHSDYGQRFQQRMQTHV 




SEQ ID NO: 11 


|3464 bp | 


N0V2b, 
CG51373-02 
DNA Sequence 


ATGCATTTGACTCTGGAAGTCTTAAACCATGGCCCCTTCCCTCTAAACCTTTCCTCCA 

TTGCTTACAATCATGGAACTGTGTTTGGCCACTGGAAGAATAACGTCACTCGGGAAAC 

GCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAA 

CTGGATCTCAACAGCGGGAAGGAAACATTTCTGGTGAATGAGGAGGCAACGGGCGAGA 

CCTCAGGAGACAATGTTGTTCATTCTAGGAATCTGTCTCAGACAATCTTCATCACCCG 

GAAACGATGGGAGGGGACCCAGACCCGCTTCAGCCAGGAGCCAGCTGACCAGACGGTG 

GTGGCTGGACAGCGGGCCGTGCTCCCCTGTGTGCTGCTCAACTACTCTGGAATTGTGC 

AATGGACCAAGGACGGGCTGGCCCTGGGCATGGGCCAGGCCCTCAAAGCCTGGCCACG 

GTACCGGGTTGTGGGCTCCGCAGACGCTGGGCAGTACAACCTGGAGATCACAGATGCT 

GAGCTCTCTGACGACGCCTCTTACGAGTGCCAGGCGACGGAGGCCGCCCTC 

GGCGGGCCAAACTCACCGTGCTCATCCCCCCAGAGGACACCAGGATTGACGGAGGCCC 

TGTGATTCTACTGCAGGCAGGCACCCCCCACAACCTCACATGCCGGGCCTTCAATGCG 

AAGCCTGCTGCCACCATCATCTGGTTCCGGGACGGGACGCAGCAGGAGGGCGCTGTGG 

CCAGCACGGAATTGCTGAAGGATGGGAAGAGGGAGACCACCGTGAGCCAACTGCTTAT 

TAACCCCACGGACCTGGACATAGGGCGTGTCTTCACTTGCCGAAGCATCAACGAAGCC 

ATCCCTAGTGGCAAGGAGACTTCCATCGAGCTGGATGTGCACCACCCTCCTACAGTGA 

CCCTGTCCATTGAGCCACAGACGGTGCAGGAGGGTCAGCGTGTTGTCTTTACCTGCCA 

GGCCACAGCCAACCCCGAGATCTTGGGCTACAGGTGGGCCAAAGGGGGTTTCTTGATT 

GAAGACGCCCACGAGAGTCGCTATGAGACAAATGTGGATTATTCCTTTTTCACGGAGC 

CTGTGTCTTGTGAGGTTCACAACAAAGTGGGAAGCACCAATGTCAGCACTTTAGTAAA 

TGTCCACTTTGCTCCCCGGATTGTAGTTGACCCCAAACCCACAACCACAGACATTGGC 

TCTGATGTGACCCTTACCTGTGTCTGGGTTGGGAATCCCCCCCTCACTCTCACCTGGA 

CCAAAAAGGACTCAAATATGGTCCTGAGTAACAGCAACCAGCTGCTGCTGAAGTCGGT 

GACTCAGGCAGACGCTGGCACCTACACCTGCCGGGCCATCGTGCCTCG 

GCTGAGCGGGAGGTGCCGCTCTATGTGAACGGGCCCCCCATCATCTCCAGTGAGGCAG 
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TGCAGTATGCTGTGAGGGGTGACGGTGGCAAGGTGGAGTGTTTCATTGGGAGCACACC 
AC C CCCAG AC CGCATAG CATGGGC CTGG AAGGAGAACTTCTTGGAGGTGGGG ACC CTG 
GAACGCTATACAGTGGAGAGGACCAACTCAGGCAGTGGGGTGCTATCCACGCTCACCA 
TCAACAATGTCATGGAGGCCGACTTTCAGACTCACTACAACTGCACCGCCTGGAACAG 
CTTCGGGCCAGGCACAGCCATCATCCAGCTGGAAGAGCGAGAGGTGTTACCTGTGGGC 




TGK5TATTCTTCCTCTACCGGCGCCGCAAAGGCAGTCGCAAAGACGTGACCCTGAGGAA 
GCTGGATATCAAGGTGGAGACAGTGAACCGAGAGCCACTTACGATGCATTCTGACCGG 
GAGGATGACACCGCCAGCGTCTCCACAGCAACCCGGGTCATGAAGGCCATCTACTCGT 
CGTTTAAGGATGATGTGGATCTGAAGCAGGACCTGCGCTGCGACACCATCGACACCCG 
GGAGGAGTATGAGATGAAGGACCCCACCAATGGCTACTACAACGTGCGTGCCCATGAA 
GACCGCCCGTCTTCCAGGGCAGTGCTCTATGCTGACTACCGTGCCCCTGGCCCTGCCC 
GCTTCGACGGCCGCCCCTCATCCCGTCTCTCCCACTCCAGCGGCTATGCCCAGCTCAA 
CACCTATAGCCGGGGCCCTGCCTCTGACTATGGCCCTGAGCCCACACCCCCTGGCCCT 
GCTGCCCCAGCTGGCACTGACACAACCAGCCAGCTGTCCTACGAGAACTATGAGAAGT 
TCAACTCCC^TCCCTTCCCTGGGGCAGCTGGGTACCCCACCTACCGACTGGGCTACCC 
CCAGGCCCCACCCTCTGGCCTGGAGCGGACCCCATATGAGGCGTATGACCCCATTGGC 
AAGTACGCCACAGCCACTCGATTCTCCTACACCTCCCAGCACTCGGACTACGGCCAGC 
GATTCCAGCAGCGCATGCAGACTCACGTGTAGGGGCCAGAGCCTGGCTGGGGCATCTC 


TGCGGGGCAGAGGAGAAGGCTTTCGCAGCTGTTCCCTGATATTCAGGGACATTGCTCA 


TTGCTCCCTTCTCGGACCAGCCTTCTTCCTCCCACCATGGCAGGTGGGGAGCAGGTCT 


CCCAGAGACACCCCGTCCCGAGGATGGTGCTCTX3TGCATGCCCCAGCCTCCTGGGCCT 




TAGAATGGGGACAGGGAAAGTGAAGGTTAGGGAAAGCAGAGGGGGGCAC'ri"l"lTAGCA 


TTCCCTTTCTATCCCACCCCTCTGATCTCCCATAAGTGGAAATGGGGGTACCCAGGGA 


TGGGCAGGCTTTGGCCTAGGGACATGAAGTATGGGAGTGGGTGGCTGTGGCACAGACA 


GGTGGAAAACGGGATAGCCTGGCCAGTCCCTCTGTTGTCTGCATTCGTGCCCTGGGTG 


CCTCTCTCCTTCCTCAGGGTACTGCAGAAGGGAGCGAACAGGGTACTGTTCGCTCTTG 


TCTACAGAACAGCCCTGGCACTGCATTCAAATCCAGTCTTCATTCAGCTGGGATCAAA 


ATGCCAGTCACCTTGGCTACCCACTCTGGACAGCTGTCTGTCAGCATGCAGAGGGATC 


CAGGAATCCCCCCGGCAGCACGGCCCGCTTTCCnTCTCCTCCATGCTGGGCCAGCCAG 


ATAAGTCAGGGTCCTGGTGGAGAAAGAAAGGCTAGGACCATGTCCTCATTGACCCAGA 


TACTGCTGTGTGCTGCACAGCAGTGAACCAACACTAGAGGGAGCCACACAAGCCTCCT 


CTCCCCAGTCTGCCCCACTTCCTGGCTTTAACT 


GGTAGGGGTGGGGGTGCTGTAGGCTCTTTTTCAAAAAAAAAC 




ORF Start: ATG at 1 1) ORF Stop: TAG at 2524 




SEQ ro NO: 12 |841 aa MW at 92988.2kD 


NOV2b, 
CG51373-02 
Protein Sequence 


MHLTLEVLNHGPFPLNLSSIAYNHGTVFGHWKNNVTRETLVKVKDM 

IxDI^SGKETFLVNEEATGBTSGDNVVHSRl^ 

VAGQRAVLPCVLLITOSGIVQWTKDGIVAI^ 

ELSDDASYECQATEAALRSRRAKLTVLIPPEIH^IDGGPVIIJjQAGTPHNLTCRAFNA 

KPAATIIWFRIX5TQQEGAVASTELLKDGKRETTVSQLLINPTDLDIGRVFTCRSMNEA 

I PSGKETS IELDVHH PPTVTLS I E PQTVQEGERWFTCQATANPE ILG YRWAKGGFLI 

EDAHESRYETNVDYSFFTEPVSCEVHNKVGSTNVSTLVNVHFAPRIVVDPKPTTTDIG 

SDVTLTCVWVGNPPLTLTWTKKDSNTWLSNSNQLLLKSVTQADAGTYTCRAIVPRIGV 

AEREVPLYVNGPPIISSEAVQYAVRGDGGKVECFIGSTPPPDRIAWAWKENFLEVGTL 

ERYTVERTNSGSGVLSTLTINNVMEADFQTHYNCTAWNSFGPGTAIIQLEBREVLPVG 

IIAGATIGASILLIFFFIALVFFLYRRRKGSRKDVTLRIQjDIKVETVNREPLTMHSDR 

EDDTASVSTATRVMKAIYSSFKDDVDLKQDLRCDTIDTREEYBMKDPTO 

DRPSSRAVLYADYRAPGPARFDGRPSSRLSHSSGYAQLNTYSRGPASDYGPEPTPPGP 

AAPAGTDTTSQLSYENYBKFNSHPFPGAAGYPTYRLGYPQAPPSGLERTPYEAYDPIG 

KYATATRFSYTSQHSDYGQRFQQRMQTHV 




SEQ ID NO: 13 3379 bp 


NOV2c, 


CTCTCCGATACTTTCTCCCAAGGGTCAGCTGCTTCTTCATTCCAAGTGGACAAGGAGC 


C^GCTGCTC^CTGTCCTTGAGAGACTTCAGCGAGAGACCAGGGTGTCCAGGCTCC^TC 
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CG5 1373-03 
DNA Sequence 



CAGGAAAGCCATGCGTATAAATTCCACCTCTGAGCCAGGCCTCACCAGCAAGCCCACT 



CTTAAGCCCTTGACTTGGGCTCCAGGGGCCATGGGAAGGAGAAACGGACCCAGACCCG 



CTTCAGCCAGGAGCCAGCTGACCAGACGGTGGTGGCTGGACAGCGGGCCGTGCTCCCC 



TGTGTGCTGCTCAACTACTCTGGAATTGTGCAATGGACCAAGGACGGGCTGGCCCTGG 



GCATGGGCCAGGCCCTCAAAGCCTGGCCACGGTACCGGGTTGTGGGCTCCGCAGACGC 
TGGGCAGTACAACCTGGAGATCACAGATGCTGAGCTCTCTGACGACGCCTCTTACGAG 
TGCCAGGCCACGGAGGCCGCCCTGCGCTCTCGGCGGGCCAAACTCACCGTGCTCATCC 
CC C CAGAGGACAC CAGGATTG ACGG AGGC C CTGTG ATT CT ACTG CAGG C AGGCACC C C 
CCACAACCTCACATGCCGGGCCTTCAATGCGAAGCCTGCTGCCACCATCATCTGGTTC 
CGGGACGGGACGCAGCAGGAGGGCGCTGTGGCCAGCACGGAATTGCTGAAGGATGGGA 
AGAGGGAGACCACCGTGAGCCAACTGCTTATTAACCCCACGGACCTGGACATAGGGCG 
TGTCI^CACTTGCCGAAGCATGAACGAAGCCATCCCTAGTGGCAAGGAGACTTCCATC 
GAGCTGGATGTGCACCACCCTCCTACAGTGACCCTGTCCATTGAGCCACAGACGGTGC 
AGGAGGGTGAGCGTGTTGTCTTTACCTGCCAGGCCACAGCCAACCCCGAGATCTTGGG 
CTACAGGTGGGCCAAAGGGGGTTTCTTGATTGAAGACGCCCACGAGAGTCGCTATGAG 
ACAAATGTGGATTATTCCTTITTCACGGAGCCTGTGTCTTGTGAGGTTCACAACAAAG 
TGGGAAGCACCAATGTCAGCACTTTAGTAAATGTCCACTTTGCTCCCCGGATTGTAGT 
TGACCCCAAACCCACAACCACAGACATTGGCTCTGATGTGACCCTTACCTGTGTCTGG 
GTTGGGAATCCCCCCCTCACTCTCACCTGGACCAAAAAGGACTCAAATATGGTCCTGA 
GTAACAGCAACCAGCTGCTGCTGAAGTCGGTGACTCAGGCAGACGCTGGCACCTACAC 
CTGCCGGGCCATCGTGCCTCGAATCGGAGTGGCTGAGCGGGAGGTGCCGCTCTATGTG 
AACGGGCCCCCCATCATCTCCAGTGAGGCAGTGCAGTATGCTGTGAGGGGTGACGGTG 
GCAAGGTGGAGTGTTTCATTGGGAGCACACCACCCCCAGACCGCATAGCATGGGCCTG 
GAAGGAGAACTTCTTGGAGGTGGGGACCCTGGAACGCTATACAGTGGAGAGGACCAAC 
TCAGGCAGTGGGGTGCTATCCACGCTCACCATCAACAATGTCATGGAGGCCGACTTTC 
AGACTCACTACAACTGCACCGCCTGGAACAGCTTCGGGCCAGGCACAGCCATCATCCA 
GCTGGAAGAGCGAGAGGTGTTACCTGTGGGCATCATAGCTGGGGCCACCATCGGCGCX3 
AGCATCCTGCTCATCTTCTTCTTCATCGCCTTGGTATTCTTCCTCTACCGGCGCCGCA 
AAGGCAGTCGCAAAGACGTGACCCTGAGGAAGCTGGATATCAAGGTGGAGACAGTGAA 
CCGAGAGCCACTTACGATGCATTCTGACCGGGAGGATGACACCGCCAGCGTCTCCACA 
GCAACCCGGGTCATGAAGGCCATCTACTCGTCGTTTAAGGATGATGTGGATCTGAAGC 
AGGACCTGCGCTGCGACACCATCGACACCCGGGAGGAGTATGAGATGAAGGACCCCAC 
CAATGGCTACTACAACGTGCGTGCCCATGAAGACCGCCCGTCTTCCAGGGCAGTGCTC 
TATGCTGACTACCGTGCCCCTGGCCCTGCCCGCTTCGACGGCCGCCCCTCATCCCGTC 
TCTCCCACTCCAGCGGCTATGCCCAGCTCAACACCTATAGCCGGGGCCCTGCCTCTGA 
CTATGGCCCTGAGCCCACACCCCCTGGCCCTGCTGCCCCAGCTGGCACTGACACAACC 
AGCCAGCTGTCCTACGAGAACTATGAGAAGTTCAACTCCCATCCCTTCCCTGGGGCAG 
CTGGGTACCCCACCTACCGACTGGGCTACCCCCAGGCCCCACCCTCTGGCCTGGAGCG 
GACCCCATATGAGGCGTATGACCCCATTGGCAAGTACGCCACAGCCACTCGATTCTCC 
TACACCTCCCAGCACTCGGACTACGGCC^GCGATTCCAGCAGCGCATGCAGACTCACG 
TGTAGGGGCCAGAGCCTGGCTGGGGCATCTCTGCGGGGCAGAGGAGAAGGCTTTCGCA 



GCTGTTCCCn'GATATTCAGGGACATTGCrCATTGCTCCCTTCTCGGACCAGCC^ 



CCTCCCACCATGGCAGGTGGGGAGCAGGTCTCCCAGAGACACCCCGTCCCGAGGATGG 



TGCTCTGTGCATGCCCCAGCCTCCTGGGCCTGCCCTTCCCTCTTC1TCGGGAGGATGT 



GTCTCTTCTGACCnXjCACTCnTGCCTGACCCTAGAATGGGGACAGGGAAAGTGAAGGT 



TAGGGAAAGCAGAGGGGGGCACTTTTTAGCATTCCCTTTCTATCCCACCCCTCTGATC 



TCCCATAAGTGGAAATGGGGGTACCCAGGGATGGGCAGGCTTTGGCCTAGGGACATGA 



AGTATGGGAGTGGGTGGCTGTGGCACAGACAGGTGGAAAACGGGATAGCCTGGCCAGT 



CCCTCTGTTGTCTGCATTCGTGCCCTGGGTGCCTCTCTCCTTCCTCAGGGTACTGCAG 



AAGGGAGCGAAC AGGGTACTGTTCGCTCTTGT CT ACAGAACAGCCCTGGCACTG C ATT 



CAAATCCAGTCTTCATTCAGCTGGGATCAAAATGCCAGTCACCTTGGCTACCCACIX^ 



GGACAGCTGTCTGTCAGCATGCAGAGGGATCCAGGAATCCCCCCGGCAGCACGGCCCG 



CTTTCCTTCTCCTCCATGCTGGGCCAGCCAGATAAGTCAGGGTCCTGGTGGAGAAAGA 



AAGGCTAGGACCATGTCCTCATTGACCCAGATACTGCTGTGTGCTGCACAGCAGTGAA 



CCAACACTAGAGGGAGCCACACAAGCCTCCTCTCCCCAGTCTGCCCCACTTCCTGGCT 



TTAACTCTTGAGCTGGTTTGGGGAGTGGTGAGGTAGGGGTGGGGGTGCTGTAGGCT 
T TTTCAAAAAAAAAC 
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ORF Start: ATG at 351 jORF Stop: TAG at 2439 




SEQ ID NO: 14 (696 aa |MW at 76928.3kD 


NOV2c, 
CG51373-03 
Protein Sequence 


MGQALKAWPRYRWGSADAGQYNLE I TDAELSDDAS YECQATEAALRS RRAKLTVL I P 
PEDTRIDGGPVILLQAGTPHNLTCRAFNAKPAATIIWFRDGTQQEGAVASTELLKDGK 
RETTVSQLLINPTDLDIGRVFTCRSMNEAIPSGKETSIELDVHHPPTVTLSIEPQTVQ 
EGERWFTCQATANPE I LGYRWAKGGFL I EDAHE SRYETNVDYS FFTE P VSCE VHNKV 
GSTNVSTLVNVHFAPRIVVDPKPTTTDIGSDVTLTCVWVGNPPLTLTWTKKDSNMVLS 
NSNQLLLKSVTQADAGTYTCRAIVPRIGVAEREVPLYVNGPPIISSEAVQYAVRGDGG 
KVECFIGSTPPPDRIAWAWKENFLEVGTLERYTVERTOSGSGVLSTLTINNVMEADFQ 
TH YNCTAWNS FGPGTAI IQLEEREVLPVGI I AGAT I GAS ILL I FFFIALVFFLYRRRK 
GSRKDVTLRKLDIKVETVNREPLTMHSDREDDTASVSTATRVMKAIYSSFKDDVDLKQ 
DLRCDTIDTREEYEMKDPTNGYYNVRAHEDRPSSRAVLYADYRAPGPARFDGRPSSRL 
SHSSGYAQLNTYSRGPASDYGPEPTPPGPAAPAGTDTTSQLSYENYEKFNSHPFPGAA 
GY PTYRLG Y PQAP P SGLERT P Y EAYD P I GKYATATR F S YT SQH S DYGQRFQQRMQTH V 




SEQ ID NO: 15 


1407 bp { 


NOV2d, 
CG51373-07 
DNA Sequence 


CGCTTCAGCCAGGAGCCAGCTGACCAGACGGTGGTGGCTGGACAGCGGGCCGTGCTCC 

CCTGTGTGCTGCTCAACTACTCTGGAATTGTGCAATGGACCAAGGACGGGCTGGCCCT 

GGGCATGGGCCAGGCCCTCAAAGCCTGGCCACGGTACCGGGTTGTGGGCTCCGCAGAC 

GCTGGGCAGTACAACCTGGAGATCACAGATGCTGAGCTCTCTGACGACGCCTCTTACG 

AGTGCCAGGCCACGGAGGCCGCCCTGCGCTCTCGGCGGGCGAAACTCACCGTGCTCAT 

CCCCCCAGAGGACACCAGGATTGACGGAGGCCCTGTGATTCTACTGCAGGCAGGCACC 

CCCCACAACCTCACATGCCGGGCCTTCAATGCGAAGCCTGCTGCCACCATCATCTGGT 

TCCGGGACGGGACGCAGCAGGAGGGCGCTGTGGCCAGCACGGAATTGCTGAAGGATGG 

GAAGAGGGAGACCACCGTGAGCCAACTGCTTATTAACCCCACGGACCTGGACATAGGG 

CGTGTCTTCACTTGCCGAAGCATGAACGAAGCCATCCCTAGTGGCAAGGAGACTTCCA 

TCGIAGCTGGATGTGCACCACCCTCCTACAGTGACCCTGTCCATTGAGCCACAGACGGT 

GCAGGAGGGTGAGCGTGTTGTCTTTACCTGCCAGGCCACAGCCAACCCCGAGATCTTG 

GGCTACAGGTGGGCCAAAGGGGGTTTCTTGATTGAAGACGCCCACGAGAGTCGCTATG 

AGACAAATGTGGATTATTCCTTTTTCACGGAGCCTGTGTCTTGTGAGGTTCACAACAA 

AGTGGGAAGCACCAATGTCAGCACTTTAGTAAATGTCCACTTTGCTCCCCGGATTGTA 

GTTGACCCCAAACCCACAACCACAGACATTGGCTCTGATGTGACCCTTACCTGTGTCT 

GGGTTGGGAATCCCCCCCTCACTCTCACCTGGACCAAAAAGGACT 

GAGTAACAGCAACCAGCTGCTGCTGAAGTCGGTGACTCAGGCAGACGCTGGCACCTAC 

ACCTGCCGGGCCATCGTGCCTCGAATCGGAGTGGCTGAGCGGGAGGTGCCGCTCTATG 

TGAACGGGCCCCCCATCATCTCCAGTGAGGCAGTGCAGTATGCTGTGAGGGGTGACGG 

TGGCAAGGTGGAGTGTTTCATTGGGAGCACACCACCCCCAGACCGCATAGCATGGGCC 

TGGAAGGAGAACTTCTTGGAGGTGGGGACCCTGGAACGCTATACAGTGGAGAGGACCA 

ACTCAGGC^GTGGGGTGCTATCCACGCTCACCATCAACAATGTCATGGAGGCCGACTT 

TCAGACTCACTACAACTGCACCGCCTGGAACAGCTTCGGGCCAGGCACAGCCATCATC 

CAGCTGGAAGAGCGA 




ORF Start: at 1 


ORF Stop: end of sequence 




SEQ ID NO: 1 6 469 aa MW at 5 1 246.2kD 


NOV2d, 
CG51373-07 
Protein Sequence 


RFSQEPADQTWAGQRAVLPCVLLNYSGIVQOTKTC 

AGQYNLEITDAELSDDASYECQATEAALRSRRAKLTVLIPPEDTRIDGGPVILLQAGT 
PHNLTCRAFNAKPAAT I IWFRDGTQQEGAVASTELLKDGKRETTVSQLIj INPTDLD I G 
RVFTCRSMNEAIPSGKETSIELDVHHPPTVTLSIEPQTVQEGERVVFTCQATANPEIL 
GYRWAKGGFLIEDAHESRYETNVDYSFFTEPVSCEVHNKVGSTNVSTLVNVHFAPRIV 
VDPKPTTTD IGSDVTLTCVWVGNP PLTLTOTKKDSNMVLSNSNQLLLKSVTQADAGTY 
TCRAI VPRIGVAEREVPLYVNGPPI I SSEAVQYAVRGDGGKVECFIGSTPPPDRI AWA 
WKENFLEVGTLERYTVERTNSGSGVLSTLTINNVMEADFQTHYNCTAWNSFGPGTAII 
QLEER 




SEQ ID NO: 17 |3430bp 


NOV2e, 


CTCTCCGATACTTTCTCCCAAGGGTCAGCTGCTTCTTCATTCCAAGTGGACAA 
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CG51373-10 
DNA Sequence 



CAGCTGCTCACTGTCCTTGAGAGACTTCAGCGAGAGACCAGGGTGTCCAGGCTCCATG 

CAGGAAAGCCATGCGTATAAATTCCACCTCTGAGCCAGGCCTCACCAGCAAGCCCACT 

CTTAAGCCCTTGACITGGGCTCCAGGGGCCATGGGAAGGAGAAACGGACCCAGACCCG 

CTTCAGCCAGGAGCCAGCTGACCAGACGGTGGTGGCTGGACAGCGGGCCGTGCTCCCC 

TGTGTGCTGCTCAACrrACTCTGGAATTGTGCAATGGACCAAGGACGGGCTGGCCCTGG 

GCATGGGCCAGGCCCTCAAAGCCTGGCCACGGTACCGGGTTGTGGGCTCCGCAGACGC 

TGGGCAGTACAACCTGGAGATCACAGATGCTGAGCTCTCTGACGACGCCTCTTACGAG 

TGCCAGGCCACGGAGGCCGCCCTGCGCTCTCGGCGGGCCAAACTCACCGTGCTCATCC 

CCCCAGAGGACACCAGGATTGACGGAGGCCCTGTGATTCTACTGCAGGCAGGCACCCC 

CCACAACCTCACATGCCGGGCCTTCAATGCGAAGCCTGCTGCCACCATCATCTGGTTC 

CGGGACGGGACGCAGCAGGAGGGCGCTGTGGCCAGCACGGAATTGCTGAAGGATGGGA 

AGAGGGAGACCACCGTGAGCCAACTGCTTATTAACCCCACGGACCTGGACATAGGGCG 

TGTCTTCACTTGCCGAAGCATGAACGAAGCCATCCCTAGTGGCAAGGAGACTTCCATC 

GAGCTGGATGTGCACCACCCTCCTACAGTGACCCTGTCCATTGAGCCACAGACGGTGC 

AGGAGGGTGAGCGTGTTGTCnTTACCTGCCAGGCCACAGCCAACCCCGAGATCI'TGGG 

CTACAGGTGGGCCAAAGGGGGTTTCTTGATTGAAGACGCCCACGAGAGTCGCTATGAG 

ACAAATGTGGATTATTCCTTTTTCACGGAGCCTGTGTCTTGTGAGGTTCACAACAAAG 

TGGGAAGCACCAATGTCAGCACITTAGTAAATGTCCACTTTGCTCCCCGGATTGTAGT 

TGACCCCAAACCCACAACCACAGACATTGGCTCTGATGTGACCCTTACCTGTGTCTGG 

GTTGGGGAAATCCCCCCCTCACTCTCACCTGGACCAAAAAGGACTCAAATATTGGGGC 

CCTGGCTTCTTGGTTCCCCACCCGAGGCTGCTCTCTCTGCCCAGGTCCTGAGTAACAG 

CAACCAGCTGCTGCTGAAGTCGGTGACTCAGGCAGACGCTGGCACCTACACCTGCCGG 

GCCATCGTGCCTCGAATCGGAGTGGCTGAGCGGGAGGTGCCGCTCTATGTGAACGGGC 

CCCCCATCATCTCCAGTGAGGCAGTGCAGTATGCTGTGAGGGGTGACGGTGGCAAGGT 

GGAGTGTTTCATTGGGAGCACACCACCCCCAGACCGCATAGCATGGGCCTGGAAGGAG 

AACTTCTTGGAGGTGGGGACCCTGGAACGCTATACAGTGGAGAGGACCAACTCAGGCA 

GTGGGGTGCTATCCACGCTCACCATCAACAATGTCATGGAGGCCGACTTTCAGACTCA 

CTACAACTGCACCGCCTGGAACAGCTTCGGGCCAGGCACAGCCATCATCCAGCTGGAA 

GAGCGAGAGGTGTTACCTGTGGGCATCATAGCTGGGGCCACCATCGGCGCGAGCATCC 

TGCTCATCTTCTTCTTCATCGCCTTGGTATTCTTCCTCTACCGGCGCCGCAAAGGCAG 

TCGCAAAGACGTGACCCTGAGGAAGCTGGATATCAAGGTGGAGACAGTGAACCGAGAG 

CCACTTACGATGCATTCTGACCGGGAGGATGACACCGCCAGCGTCTCCACAGCAACCC 

GGGTCATGAAGGCCATCTACTCGTCGTTTAAGGATGATGTGGATCTGAAGCAGGACCT 

GCGCTGCGACACCATCGACACCCGGGAGGAGTATGAGATGAAGGACCCCACCAATGGC 

TACTACAACGTGCGTGCCCATGAAGACCGCCCGTCTTCCAGGGCAGTGCTCTATGCTG 

ACTACCGTGCCCCTGGCCCTGCCCGCTTCGACGGCCGCCCCTCATCCCGTCTCTCCCA 

CTCCAGCGGCTATGCCCAGCTCAACACCTATAGCCGGGGCCCTGCCTCTGACTATGGC 

CCTGAGCCCACACCCCCTGGCCCTGCTGCCCCAGCTGGCACTGACACAACCAGCCAGC 

TGTCCTACGAGAACTATGAGAAGTTCAACTCCCATCCCTTCCCTGGGGCAGCTGGGTA 

CCCCACCTACCGACTGGGCTACCCCCAGGCCCCACCCTCTGGCCTGGAGCGGACCCCA 

TATGAGGCGTATGACCCCATTGGCAAGTACGCCACAGCCACTCGATTCTCCTACACCT 

CCCAGCACTCGGACTACGGCCAGCGATTCCAGCAGCGCATGCAGACTCACGTGTAGGG 

GCCAGAGCCTGGCTGGGGCATCTCTGCGGGGCAGAGGAGAAGGCTTTCGCAGCT 

CCTGATATTCAGGGACATTGCTCATTGCTCCCTTCTCGGACCAGCCTTCTTCCTCCCA 

CCATGGCAGGTGGGGAGCAGGTCTCCCAGAGACACCCCGTCCCGAGGATGGTGCTCTG 

TGCATGCCCCAGCCTCCTGGGCCTGCCCTTCCCTCTTCTTCGGGAGGATGTCTCTCTT 

CTGACCTGCACTCTTGCCTGACCCTAGAATGGGGACAGGGAAAGTGAAGGTTAGGGAA 

AGCAGAGGGGGGCACTTTTTAGCATTCCCITTCTATCCCACCCCTCTGAT 

AGTGGAAATGGGGGTACCCAGGGATGGGCAGGCTTTGGCCTAGGGACATGAAGTATGG 

GAGTGGGTGGCTGTGGCACAGACAGGTGGAAAACGGGATAGCCTGGCCAGTCCCTCTG 

TTGTCTGCATTCGTGCCCTGGGTGCCTCTCTCCTTCCTCAGGGTACTGCAGAAGGGAG 

CGAACAGGGTACTGTTCGCTCTTGTCTACAGAACAGCCCTGGCACTGCATTCAAATCC 

AGTCITCATTCAGCTGGGATCAAAATGCCAGTCACCTTGGCTACC^ 

TGTCTGTCAGCATGCAGAGGGATCCAGGAATCCCCCCGGCAGCACGGCCCGCTTTCCT 

TCTCCTCCATGCTGGGC CAGCCAGATAAGTCAGGGT CCTGGTGGAGAAAGAAAGGCT A 

GGACCATGTCCTCATTGACCCAGATACTGCTGTGTGCTGCACAGCAGT 

TAGAGGGAGCCACACAAGCCTCCTCTCCCCAGTCTGCCCCACTTCCTGGCri^rAACTC 
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TTGAGCTGGTTTGGGGAGTGGTGAGGTAGGGGTGGGGGTGCTGTAGGCTCTTTTTCAA 




AAAAAAAC 




ORF Start: ATG at 351 


I 




|ORF Stop: TAG at 2490 




SEQIDNO: 18 


713 aa 


MW at 78491 .OkD 


NOV2e 
CG51373-10 
Protein Sequence 


MGQALKAWPRYR VVGSADAGQYNLE ITDAELSDDAS YECQATEAALRS RRAKLTVL I P 
PEDTRIIXSGPVIIAjQAGTPHNLTCRAF^AKPAATIIWFRDGTQQEGAVASTELLKDGK 
RETTVSQLLINPTDLDIGRVPTCRSMNEAIPSGKETSIELDVHHPPTVTLSIEPQTVQ 
EGERWFTCQATANPEILGYRWAKGGFLIEDAHESRYETNVDYSFFTEPVSCEVHNKV 
GSTNVSTLVNVHFAPRIVVDPKPTTTDIGSDVTLTCVWVGEIPPSLSPGPKRTQILGP 
WLLGSPPEAALSAQVLSNSNQLLLKSVTQADAGTYTCRAIVPRIGVAEREVPLYVNGP 
PIISSEAVQYAVRGDGGKVECFIGSTPPPDRIAWAWKENFLEVGTLERYTVERTNSGS 
GVLSTLT INNVMEADFQTHYNCTAWNSFGPGTAI IQLEEREVLP VGI I AGATI GAS I L 
LI FFFI ALVFFLYRRRKGSRKDVTLRKLDI KVBTVNREPLTMHSDREDDTASVSTATR 
VMKAIYSSFKDDVDLKQDLRCDTIDTREEYEMKDPTNGYYNVRAHEDRPSSRAVLYAD 
YRAPGPARFDGRPSSRLSHSSGYAQLNTYSRGPASDYGPEPTPPGPAAPAGTDTTSQL 
SYENYEKFNSHPF PGAAGYPTYRLGYPQAPPSGLERTPYEAYDP IGKYATATRFSYTS 
QHSDYGQRFQQRMQTHV 




SEQIDNO: 19 


|3212bp 






NOV2f, 
CG5 1373-11 
DNA Sequence 


ATGCTGAGCCTCCTCGTCTGGATCCTCACTCTCTCCGATACTTTCTCCCAAGGGACCC 
AGACCCGCTTCAGCCAGGAGCCAGCTGACCAGACGGTGGTGGCTGGACAGCGGGCCGT 
GCTCCCCTGTGTGCTGCTCAACTACTCTGGAATTGTGCAATGGACCAAGGACGGGCTG 
GCCCTGGGCATGGGCCAGGGCCTCAAAGCCTGGCCACGGTACCGGGTTGTGGGCTCCG 
CAGACGCTGGGCAGTACAACCTGGAGATCACAGATGCTGAGCTCTCTGACGACGCCTC 
TTACGAGTGCCAGGCCACGGAGGCCGCCCTGCGCTCTCGGCGGGCCAAACTCACCGTG 
CT CATC C C CCC AGAGG ACACCAGG ATTGACGGAGGCCCTGTGATT CTACTGCAGGCAG 
GCACCCCCCACAACCTCACATGCCGGGCCTTCAATGCGAAGCCTGCTGCCACCATCAT 
CTGGTTCCGGGACGGGACGCAGCAGGAGGGCGCTGTGGCCAGCACGGAATTGCTGAAG 
GATGGGAAGAGGGAGACCACCGTGAGCCAACTGCTTATTAACCCCACGGACCTGGACA 
TAGGGCGTGTCTTCACTTGCCGAAGCATGAACGAAGCCATCCCTAGTGGCAAGGAGAC 
TTCCATCGAGCTGGATGTGCACCACCCTCCTACAGTGACCCTGTCCATTGAGCCACAG 
ACGGTGCAGGAGGGTGAGCGTGTTGTCTTTACCTGCCAGGCCACAGCCAACCCCGAGA 
TCTTGGGCTACAGGTGGGCCAAAGGGGGTTTCTTGATTGAAGACX5CCCACGAGAGTCG 
CTATGAGACAAATGTGGATTATTCCTTTTTCACGGAGCCTGTGTCTTGTGAGGTTCAC 
AACAAAGTGGGAAGCACCAATGTCAGCACTTTAGTAAATGTCCACTTTGCTCCCCGGA 
TTGTAGTTGACCCCAAACCCACAACC^CAGACATTGGCTCTGATGTGACCCTTACCTG 
TGTCTGGGTTGGGAATCCCCCCCTCACTCTCACCTGGACCAAAAAGGACTCAAATATG 
GTCCTGAGTAACAGCAACCAGCTGCTGCTGAAGTCGGTGACTCAGGCAGACGCTGGCA 
CCTACACCTGCCGGGCCATCGTGCCTCGAATCGGAGTGGCTGAGCGGGAGGTGCCGCT 
CTATGTGAACGGGCCCCCCATCATCTCCAGTGAGGCAGTGCAGTATGCTGTGAGGGGT 
GACGGTGGCAAGGTGGAGTGTTTCATTGGGAGCACACCACCCCCAGACCGCATAGCAT 
GGGCCTGGAAGGAGAACTTCTTGGAGGTGGGGACCCTGGAACGCTATACAGTGGAGAG 
GACCAACTCAGGCAGTGGGGTGCTATCCACGCTCACCATCAACAATGTCATGGAGGCC 
GACTTT(^GACTCACTACAACTGCACCGCCTGGAACAGCTTCGGGCCAGGCACAGCCA 
TCATCCAGCTGGAAGAGCGAGAGGTGTTACCTGTGGGCATCATAGCTGGGGCCACCAT 
CGGCGCGAGCATCCTGCTCATCTTCTTCTTCATCGCCTTGCT 

CGCCGCAAAGGCAGTCGCAAAGACGTGACCCTGAGGAAGCTGGATATCAAGGTGGAGA 
CAGTGAACCGAGAGCCACTTACGATGCATTCTGACCGGGAGGATGACACCGCCAGCGT 
CTCCACAGCAACCCGGGTCATGAAGGCCATCTACTCGTCGTTTAAGGATGATGTGGAT 
CTGAAGCAGGACCTGCGCTGCGACACCATCGACACCCGGGAGGAGTATGAGATGAAGG 
ACCCCACCAATGGCTACTACAACGTGCGTGCCCATGAAGACCGCCCGTCTTCCAGGGC 
AGTGCTCTATGCTGACTACCGTGCCCCTGGCCCTGCCCGCTTCGACGGCCGCCCCTCA 
TCCCGTCTCTCCCACTCCAGCGGCTATGCCCAGCTCAACACCTATAGCCGGGGCCCTG 
CCTCTGACTATGGCCCrGAGCCCACACCCCCTXXXrCCT^ 

CACAACCAGCCAGCTGTCCTACGAGAACTATGAGAAGTTCAACTCCCATCCCTTCCCT 
GGGGCAGCTGGGTACCCCACCTACCGACTGGGCTACCCCCAGGCCCCACCCTCTGGCC 
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TGGAGCGGACCCCATATGAGGCGTATGACCCCATTGGCAAGTACGCCACAGCCACTCG 
ATTCTCCTACACCTCCCAGCACTCGGACTACGGCCAGCGATTCCAGCAGCGCATGCAG 
ACTCACGTGTAGGGGCCAGAGCCTGGCTGGGGCATCTCTGCGGGGCAGAGGAGAAGGC 


TTTCGCAGCTGTTCCCTGATATTCAGGGACATTGCTCATTGCTCCCTTCTCGGACCAG 




CCT T CTT C CT C C C AC CATGGCAGGTGGGGAGCAGGT CTC C CAG AGACAC CC CGT C C CG 




AGGATGGTGCTCTGTGCATGCCCCAGCCTCCTGGGCCTGCCCTTCCCTCITCTTCGGG 




AGG ATGTGT CTC TTCTG AC CTG CACT CTTG CCTG AC C CT AG AATGGGG ACAGGG AAAG 


TGAAGGTT AGGGAAAGCAG AGGGGGGCACT'lT'lTAG CATTCCCTTTCT ATC C CACCCC 


TCTGATCTCCCATAAGTGGAAATGGGGGTACCCAGGGATGGGCAGGCTTTGGCCTAGG 


GACATGAAGTATGGGAGTGGGTGGCTGTGGCACAGACAGGTGGAAAACGGGATAGCCT 


GGCCAGTCCCTCTGTTGTCTGCATTCGTGCCCTGGGTGCCTCTCTCCTTCCTCAGGGT 


ACTGCAGAAGGGAGCGAACAGGGTACTGTTCGCTCTTGTCTACAGAACAGCCCTGGCA 


CTGC^TTCAAATCCAGTCITCATTCAGCTGGGATCAAAATGCCAGTCACCTTGGCTAC 


CCACTGTGGACAGCTGTCTGTCAGCATGCAGAGGGATCCAGGAATCCCCCCGGCAGCA 


CGGCCCGCTTTCCTTCTCCTCCATGCTGGGCCAGCCAGATAAGTCAGGGTCCTGGTGG 


AGAAAGAAAGGCTAGGACGATGTCCTCATTGACCCAGATACTGCTGTGTGCTGCACAG 


CAGTGAACGAACACTAGAGGGAGCGACACAAGCCTCCTCTCCCCAGTCTGCCCCACTT 


CCTGGCTTTAACTCTTGAGCTGGTTTGGGGAGTGGTGAGGTAGGGGTGGGGGTGCTGT 


AGGCTCTTTTTCAAAAAAAAAC 




ORF Start: ATG at 1 j ORF Stop: TAG at 2272 




SEQ ID NO: 20 |757 aa |mW at 83534.8WD 


NOV2f, 
CG51373-11 
Protein Sequence 


MLSLLVWILTLSDTFSQGTQTRFSQEPADQTWAGQRAVLPCVLLNYSGIVQWTKTC 

ALGMGOGIjKAWPRYRWGSADAGOYNLEITDAELSDDASYECOATEAALRSRRAKLTV 

LI PPEDTRIDGGPVILLQAGTPHNLTCRAFNAKPAATI I WFRDGTQQEGAVASTELLK 

DGKRETTVSQLLINPTDLDIGRVFTCRSMNEAIPSGKETSIELDVHHPPTVTLSIEPQ 

TVQEGERWFTCQATANPEILGYRWAKGGFLIEDAHESRYETNVDYSFFTEPVSCEVH 

NKVGSTNVSTLVNVHFAPRIVVDPKPTTTDIGSDVTLTCVWVGNPPLTLTWTKKDSNM 

VLSNSNQLLLKSVTQADAGTYTCRAIVPRIGVAEREVPLYVNGPPIISSEAVQYAVRG 

DGGKVECFIGSTPPPDRIAWAWKENFLEVGTLERYTVERTNSGSGVLSTLTINNVMEA 

DFQTHYNCTAWNSFGPGTAI IQLEEREVLPVGI IAGATIGASILLIFFFI ALVFFLYR 

RRKGSRKDVTLRKLDIKVETVNREPLTMHSDREDDTASVSTATRVMKAIYSSFKDDVD 

LKQDLRCDTIDTREEYEMKDPTNGYYNVRAHEDRPSSRAVLYADYRAPGPARFDGRPS 

SRLSHSSGYAQLNTYSRGPASDYGPEPTPPGPAAPAGTDTTSQLSYENYEKFNSHPFP 

GAAGYPTYRLGYPQAPPSGLERTPYEAYDPIGKYATATRFSYTSQHSDYGQRFQQRMQ 

THV 




SEQ ID NO: 21 |2290bp | 


NOV2g, 
CG51373-13 
DNA Sequence 


CACCAAGCTTATGCTGAGCCTCCTCGTCTGGATCCTCACTCTCTCCGATACTTTCTCC 

CAAGGGACCCAGACCCGCTTCAGCCAGGAGCCAGCTGACCAGACGGTGGTGGCTGGAC 

AGCX^CCGTGCTCCCCTGTGTGCTGCTCAACTACTCTGGAATTGTGCAATGGACC^ 

GGACGGGCTGGCCCTGGGCATGGGCCAGGGCCTCAAAGCCTGGCCACGGTACCGGGTT 

GTGGGCTCCGCAGACGCTGGGCAGTACAACCTGGAGATCACAGATGCTGAGCTCTCTG 

ACGACGCCTCTTACGAGTGCCAGGCCACGGAGGCCGCCCK3CGCTCTCGGCGGGCCAA 

ACTCACCGTGCTCATCCCCCCAGAGGACACCAGGATTGACGGAGGCCCTGTGATTCTA 

CTGCAGGCAGGCACCCCCCACAACCTCACATGCCGGGCCTTCAATOTGAAGCCTGCTG 

CCACCATCATCTGGTTCCGGGACGGGACGCAGCAGGAGGGCGCTGTGGCCAGCACGGA 

ATTGCTGAAGGATGGGAAGAGGGAGACGACCGTGAGCCAACTGCTT^ 

GACCTGGACATAGGGCGTGTCTTCACTTGCCGAAGCATGAACGAAGCCATCCCTAGTG 

GCAAGGAGACTTCCATCGAGCTCGATGTGCACCACCCTCCTACAGTGACCCTGT^ 

TGAGCCACAGACGGTGCAGGAGGGTGAGCGTGTTGTCTTTACCTGCCAGG^ 

AACCCGGAGATCTTGGGCTACAGGTGGGCCAAAGGGGGTTTCTTGATTGAAGACGCCC 

ACGAGAGTCGCTATGAGACAAATGTGGATTATTCCT TTTTCACGGAGCCTGTGTCTTG 

TGAGGT CCACAACAAAGTGGGAAGCACCAATGTCAG CACTTTAGT AAATGTCCACTTT 

GCTCCCCGGATTGTAGTTGACCCCAAACCCACAACCACAGACATTGGCTCTGATGTGA 

CCCTTACCTGTGTCTGGGTTGGGAATCCCCCCCTCACTCTCACCTGGACCAAAAAGGA 

CTCAAATATGGTCCTGAGTAACAGCAACCAGCTGCTGCrGAAGTCGGTGACT 
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GACGCTGGCACCTACACCTGCCGGGCCATCGTGCCTCGAATCGGAGTGGCTGAGCGGG 
AGGTGCCGCTCTATGTGAACGGGCCCCCCATCATCTCCAGTGAGGCAGTGCAGTATGC 
TGTGAGGGGTGACGGTGGCAAGGTGGAGTGTTTCATTGGGAGCACACCACCCCCAGAC 
CGCATAGCATGGGCCTGGAAGGAGAACTTCTTGGAGGTGGGGACCCTGGAACGCTATA 
CAGTGGAGAGGACCAACTCAGGCAGTGGGGTGCT AT C CACGCTCACCAT CAACAATGT 
C^TGGAGGCCGACTTTCAGACTCACTACAACTGCACCGCCTGGAACAGCTTCGGGCCA 
GGCACAGCCATCATCCAGCTGGAAGAGCGAGAGGTGTTACCTGTGGGCATCATAGCTG 
GGGCCACCATCGGCGCGAGCATCCTGCTCATCTTCTTCTTCATCGCCTTGGTATTCTT 
CCTCTACCGGCGCCGCAAAGGCAGTCGCAAAGACGTGACCCTGAGGAAGCTGGATATC 
AAGGTGGAGACAGTGAACCGAGAGCCACTTACGATGCATTCTGACCGGGAGGATGACA 
CCGCCAGCGTCTCCACAGCAACCCGGGTCATGAAGGCCATCTACTCGTCGTTTAAGGA 
TGATGTGGATCTGAAGCAGGACCTGCGCTGCGACACCATCGACACCCGGGAGGAGTAT 
GAGATGAAGGACCCCACCAATGGCTACTACAACGTGCGTGCCCATGAAGACCGCCCGT 
CTTCCAGGGCAGTGCTCTATGCTGACTACCGTGCCCCTGGCCCTGCCCGCTTCGACGG 
CCGCCCCTCATCCCGTCTCTCCavCTCCAGCGGCTATGCCCAGCTCAAC^CCTATAGC 
CGGGGCCCTGCCTCTGACTATGGCCCTGAGCCCACACCCCCTGGCCCTGCTGCCCCAG 
CTGGCACTGACACAACCAGCCAGCTGTCCTACGAGAACTATGAGAAGTTCAACTCCCA 
TCCCTTCCCTGGGGCAGCTGGGTACCCCACCTACCGACTGGGCTACCCCCAGGCCCCA 
CCCTCTGGCCTGGAGCGGACCCCATATGAGGCGTATGACCCCATTGGCAAGTACGCCA 
CAGCCACTCGATTCTCCTACACCTCCCAGCACTCGGACTACGGCCAGCGATTCCAGCA 
GCGCATGCAGACTCACGTGGTCGACGGC 




ORF Start: ATG at 11 


|ORF Stop: at 2282 




SEQIDNO:22 | 


757 aa |MW at 83534.8kD 


NOV2g, 
CG51373-13 
Protein Sequence 


MLSLLVWILTLSDTFSQGTQTRFSQEPADQTWAGQRAVLPCVLLNYSGIVQVfTKDGL 
ALGMGQGLKAWPRYRWGS ADAGQYNLE I TDAELSDDAS YECQATEAALRSRRAKLTV 
LI PPEDTRIDGGPVILLQAGTPHNLTCRAFNAKPAATI IWFRDGTQQEGAVASTELLK 
IX3KRETTVSQLLINPTDLDIGRVFTCRSMNEAIPSGKETSIELDVHHPPTVTLSIEPQ 
TVQEGERWFTCQATANPEILGYRWAKGGFLIEDAHESRYETNVDYSFFTEPVSCEVH 
NKVGSTNVSTLVNVHFAPRIVVDPKPTTTDIGSDVTLTaWGNPPLTLTWTKKDSNM 
VLSNSNQLLLKSVTQADAGTYTCRAIVPRIGVAEREVPLYVNGPPIISSEAVQYAVRG 
DGGKVECFIGSTPPPDRIAWAWKENFLEVGTLERYTVERTNSGSGVLSTLTINNVMEA 
DFQTHYNCTAWNSFGPGTAI IQLEERE VLPVGI I AGATIGASILLI FFFI ALVFFLYR 
RRKGSRKDVTLRKLDIKVETVNREPLTMHSDREDDTASVSTATRVMKAIYSSFKDDVD 
LKQDLRCDTIDTREEYEMKDPTNGYYNVRAHEDRPSSRAVLYADYRAPGPARFDGRPS 
SRLSHSSGYAQLNTYSRGPASDYGPEPTPPGPAAPAGTDTTSQLSYENYEKFNSHPFP 
GAAGYPTYRLGYPQAPPSGLERTPYEAYDPIGKYATATRFSYTSQHSDYGQRFQQRMQ 
THV 




SEQIDNO: 23 


|2290bp 




NOV2h, 
CG5 1373-14 


CACCAAGCTTCAAGGGACCCAGACCCGCTTCAGCCAGGAGCCAGCTGACCAGACGGTG 
GTGGCTGGACAGCGGGCCGTGCTCCCCTGTGTGCTGCTCAACTACTCTGGAATTGTGC 
AATGGACCAAGGACGGGCTGGCCCTGGGCATGGGCCAGGGCCTCAAAGCCTGGCCACG 
GTACCGGGTTGTGGGCTCCGCAGACGCTGGGCAGTACAACCTGGAGATCACAGATGCT 
GAGCTCTCTGACGACGCCTCTTACGAGTGCCAGGCCACGGAGGC03CCCTGCGCTCTC 
GGCGGGCCAAACTCACCGTGCTCATCCCCCCAGAGGACACCAGGATTGACGGAGGCCC 
TGTGATTCTACTGCAGGCAGGCACCCCCCACAACCTCACATGCCGGGCCTTCAATGCG 
AAGCCTGCTGCCACCATCATCTGGTTCCGGGACGGGACGCAGCAGGAGGGCGCTGTGG 
CCAGCACGGAATTGCTGAAGGATGGGAAGAGGGAGACCACCXn'GAGCCAACTGCTTAT 
TAACCCCACGGACCTGGACATAGGGCGTGTCTTCACTTGCCGAAGCATGAACGAAGCC 
ATCCCTAGTGGCAAGGAGACTTCCATCGAGCTGGATGTGCACCACCCTCCTACAGTGA 
CCCTGTCCATTGAGCCACAGACGGTGCAGGAGGGTGAGCGTGTTGTCTTTACCTGCCA 
GGCCACAGCCAACCCGGAGATCTTGGGCTACAGGTGGGCCAAAGGGGGTTTCTTGATT 
GAAGACGCCCACGAGAGTCGCTATGAGACAAATGTGGATTATTCCTTTTTCACGGAGC 
CTGTGTCTTGTG AGGTTCACAACAAAGTGGGAAG CACCAATGT CAGCACTTT AGT AAA 
TGTCCACTTTGCTCCCCGGATTGTAGTTGACCCCAAACCCACAACCACAGACATTGGC 
TCTGATGTGACCCTTACCTGTGTCTGGGTTGGGAATCCCCCCCTCACTCTCACCTGGA 
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CCAAAAAGGACTCAAATATGGGGCCCAGGCCTCCTGGCTCCCCACCCGAGGCTGCTCT 
CTCTGCCCAGGTCCTGAGTAACAGCAACCAGCTGCTGCTGAAGTCGGTGACTCAGGCA 
GACGCTGGCACCTACACCTGCCGGGCCATCGTGCCTCGAATCGGAGTGGCTGAGCX5GG 
AGGTGCCGCTCTATGTGAACGGGCCCCCCATCATCTCCAGTGAGGCAGTGCAGTATGC 
TGTGAGGGGTGACGGTGGCAAGGTGGAGTGTTTCATTGGGAGCACACCACCCCCAGAC 
CGCATAGCATGGGCCTGGAAGGAGAACTTCTTGGAGGTGGGGACCCTGGAACGCTATA 
CAGTGGAGAGGACC^CTC^GGCAGTGGGGTGCTATCCACGCTCACCATCAACAATGT 
CATGGAGGCCGACTTTCAGACTCACTACAACTGCACCGCCTGGAACAGCTTCGGGCCA 
GGCACAGCCATCATCCAGCTGGAAGAGCGAGAGGTGTTACCTGTGGGCATCATAGCTG 
GGGCCACCATCGGCGCGAGCATCCTGCTCATCTTCTTCTTCATCGCCTTGGTATTCTT 
CCTCTACCGGCGCCGCAAAGGCAGTCGCAAAGACGTGACCCTGAGGAAGCTGGATATC 
AAGGTGGAGACAGTGAACCGAGAGCCACTTACGATGCATTCTGACCGGGAGGATGACA 
CCGCCAGCGTCTCCACAGCAACCCGGGTCATGAAGGCCATCTACTCGTCGTTTAAGGA 
TGATGTGGATCTGAAGCAGGACCTGCGCTGCGACACCATCGACACCCGGGAGGAGTAT 
GAGATGAAGGACCCCACCAATGGCTACTACAACGTGCGTGCCCATGAAGACCGCCCGT 
CTTCC^GGGCAGTGCTCTATGCTGACTACCGTGCCCCTGGCCCTGCCCGCTTCGACGG 
CCGCCCOTCATCCCGTCTCTCCCACTCCAGCGGCTATGCCCAGCTCAACACCTATAGC 
CGGGGCCCTGCCTCTGACTATGGCCCTTS^^ 

CTGGCACTGACACAACCAGCCAGCTGTCCTACGAGAACTATGAGAAGTTCAACTCCCA 
TCCCTTCCCTGGGGCAGCTGGGTACCCCACCTACCGACTGGGCTACCCCCAGGCCCCA 
CCCTCTGGCCTGGAGCGGACCCCATATGAGGCGTATGACCCCATTGGCAAGTACGCCA 
CAGCCACTCGATTCTCCTACACCTCCCAGCACTCGGACTACGGCCAGCGATTCCAGCA 
GCGCATGCAGACTCACGTGGTCGACGGC 




ORF Start: at 1 1 | |ORF Stop: at 2282 




SEQIDNO: 24 


757 aa |MW at 83227.3kD 


NOV2h, 
CG51373-14 
Protein Sequence 


QGTQTRFSQEPATOTWAGQRAVLPCnrc^iNYSGIV^ 

VGSADAGQYNLE ITDAELSDDASYECQATEAALRSRRAKLTVLI PPEDTRIDGGPVIL 
LQAGTPHNLTCRAFNAKPAAT 1 I WFRDGTQQEGAVASTELLKDGKRETTVSQLL INPT 
DLDIGRVFTCRSMNEAIPSGKETSIELDVHHPPTVTLSIEPQTVQEGERVVFTCQATA 
NPEILGYRWAKGGFLIEDAHESRYETNVDYSFFTEPVSCEVHNKVGSTNVSTLVNVHF 
APRIVVDPKPTTTDIGSDVTLTCVWVGNPPLTLTWTKKDSNMGPRPPGSPPEAALSAQ 
VLSNSNQLLLKSVTQADAGTYTCRAI VPRIGVAEREVPLYVNGPPI I SSEAVQYAVRG 
DGGKVECFIGSTPPPDRIAWAWKENFLEVGTLERYTVERTNSGSGVLSTLTINNVMEA 
DFQTHYNCTAWNSFGPGTAI IQLEEREVLPVGI I AGATIGASILLI FFFIALVFFLYR 
RRKGSRKDVTLRKLDIKVETVNREPLTMHSDREDDTASVSTATRVMKAIYSSFKDDVD 
LKQDLRCDTIDTREEYEMKDPTNGYTfNVRAHEDRPSSRAVLYADYRAPGPARFDGRPS 
SRLSHSSGYAQLNTYSRGPASDYGPEPTPPGPAAPAGTDTTSQLSYENYEKFNSHPFP 
GAAGYPTYRLGYPQAPPSGLERTPYEAYDPIGKYATATRFSYTSQHSDYGQRFQQRMQ 
THV 



Sequence comparison of the above protein sequences yields the following 



sequence relationships shown in Table 2B. 



Table 2B. Comparison of NOV2a against NOV2b through NOV2h. 


Protein Sequence 


NOV2a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV2b 


17..773 
101..841 


739/757 (97%) 
740/757 (97%) 


NOV2c 


62..773 
1..696 


695/712(97%) 
695/712(97%) 
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NOV2d 


22..506 


468/485 (96%) 




1..469 


468/485 (96%) 


NOV2e 


62..773 


696/713 (97%) 




1..713 


698/713 (97%) 


NOV2f 


1..773 


757/773 (97%) 




1..757 


757/773 (97%) 


NOV2g 


1..773 


757/773 (97%) 




1..757 


757/773 (97%) 


NOV2h 


17..773 


757/757 (100%) 




1..757 


757/757 (100%) 



Further analysis of the NOV2a protein yielded the following properties shown in 
Table 2C. 



Table 2C. Protein Sequence Properties NOV2a 


SignalP analysis: 


Cleavage site between residues 17 and 18 


PSORTH 
analysis: 


PSG: a new signal peptide prediction method 

N-region: length 0; pos.chg 0; neg.chg 0 
H-region: length 12; peak value 10.45 
PSG score: 6.05 

GvH: von Heijne's method for signal seq. recognition 
GvH score (threshold: -2.1): 0.73 
possible cleavage site: between 16 and 17 

>>> Seems to have a cleavable signal peptide (1 to 16) 

ALOM: Klein et al ' s method for TM region allocation 
Init position for calculation: 17 

Tentative number of TMS(s) for the threshold 0.5: 

1 

Number of TMS(s) for threshold 0.5: 1 

INTEGRAL Likelihood =-12.26 Transmembrane 519 

* 535 

PERIPHERAL Likelihood = 3.61 (at 38) 
ALOM score: -12.26 (number of TMSs: 1) 

MTOP: Prediction of membrane topology (Hartmann et al . ) 
Center position for calculation: 8 
Charge difference: -2.0 C(-1.0) - N( 1.0) 
N >= C: N- terminal side is inside 

»> membrane topology: type la (cytoplasmic tail 536 to 
773) 

MITDISC: discrimination of mitochondrial targeting seq 
R content: 1 Hyd Moment (75) : 1.68 
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Hyd Moment (95): 2.60 G content: 1 
D/E content: 2 S/T content: 8 
Score: -5.22 




Gavel: prediction of cleavage sites for mitochondrial 
Dresea 

R-2 motif at 32 TRF|SQ 




NUCDISC: discrimination of nuclear localization signals 
pat4: RRRK (5) at 538 
pat 7 : none 
bipartite: none 

content of basic residues: 9.6% 
NLS Score: -0.16 




KDEL: ER retention motif in the C-terminus: none 




ER Membrane Retention Signals : none 




SKL: peroxisomal targeting signal in the C-terminus : none 




PTS2 : 2nd peroxisomal targeting signal: none 




VAC: possible vacuolar targeting motif: none 




RNA-binding motif: none 




Actinin-type actin-binding motif : 
type 1: none 
type 2 : none 




NMYR: N-myristoylation pattern : none 




Prenylation motif: none 




memYQRL: transport motif from cell surface to Golgi : none 




Tyrosines in the tail: too long tail 




Hi 1 ai im{ no mnf 1 f IT* t" Vl ^ fail. r\ OTl & • 
UllcUClUc UlUL.il. HI toll . 11UHC 




checking 63 PROSITE DNA binding motifs: none 




checking 71 PROSITE ribosomal protein motifs: none 




checking 33 PROSITE prokaryotic DNA binding motifs: none 




NNCN: Reinhardt's method for Cytoplasmic/Nuclear 
discrimination 

Prediction: cytoplasmic 

Reliability: 70.6 



36 



WO 03/080856 



PCT/US03/08490 



COIL: Lupas's algorithm to detect coiled-coil regions 
total: 0 residues 



Final Results (k = 9/23) : 

44.4 %: endoplasmic reticulum 

22.2 %: Golgi 

22.2 %: extracellular, including cell wall 

11.1 %: plasma membrane 

» prediction for CG51373-12 is end (k=9) 



A search of the NOV2a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 2D. 

5 



37 



WO 03/080856 



PCT/US03/08490 



Table 2D. Geneseq Results for NOV2a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV2a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


ABB05749 


Human G protein-coupled receptor 
NOVla protein SEQ ID NO:2 - 
Homo sapiens, 841 aa. 
[WO200200691-A2, 03-JAN- 
2002] 


17..773 
101..841 


739/757 (97%) 
740/757 (97%) 


0.0 


ABP66884 


Human polypeptide SEQ ID NO 
605 - Homo sapiens, 749 aa. 
[US2002090672-A1, 11-JUL- 
2002] 


57..773 
33.-749 


717/717 (100%) 
717/717(100%) 


0.0 


ABB 10297 


Human cDNA SEQ ID NO: 605 - 
Homo sapiens, 749 aa. 
[WO200154474-A2, 02-AUG- 
2001] 


57..773 
33..749 


717/717(100%) 
717/717 (100%) 


0.0 


ABG65107 


Human albumin fusion protein 
#1782 - Homo sapiens, 712 aa. 
(WO200177137-A1, 18-OCT- 
2001] 


62..773 
1..712 


712/712 (100%) 
712/712(100%) 


0.0 


AAE07070 


Human gene 20 encoded secreted 
protein HDTJG33, SEQ ID NO:87 
- Homo sapiens, 712 aa. 
[WO200154708-A1, 02-AUG- 
2001] 


62..773 
1..712 


712/712(100%) 
712/712(100%) 


0.0 



In a BLAST search of public sequence databases, the NOV2a protein was found to 



have homology to the proteins shown in the BLASTP data in Table 2E. 



38 



WO 03/080856 



PCT/US03/08490 



Table 2E. Public BLASTP Results for NOV2a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV2a 
Residues/ 
Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


CAD23338 


Sequence 1 from Patent 
WO0200691 - Homo sapiens 
(Human), 841 aa. 


1 7 7*71 

101..841 


710/7^7 fQ70/.\ 

740/757 (97%) 


u.u 


Q8CJ59 


NEPH1 - Mus musculus 
(Mouse), 789 aa. 


18..773 
50..789 


714/756(94%) 
726/756 (95%) 


0.0 


CAD23349 


Sequence 25 from Patent 
WO0200691 - Homo sapiens 
(Human), 696 aa. 


62..773 
1..696 


695/712 (97%) 
695/712 (97%) 


0.0 


CAD23348 


Sequence 23 from Patent 
WO0200691 - Homo sapiens 
(Human), 713 aa. 


62. .773 
1..713 


696/713(97%) 
698/713(97%) 


0.0 


Q96J84 


NEPH1 - Homo sapiens 
(Human), 60S aa. 


1..607 
1..591 


590/607 (97%) 
591/607 (97%) 


0.0 



PFam analysis predicts that the NOV2a protein contains die domains shown in the 



Table 2F. 



Table 2F. Domain Analysis of NOV2a 


Pfam Domain 


NOV2a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


ig 


35..102 


12/72(17%) 
47/72 (65%) 


1.3e-05 


<g 


136..202 


14/69 (20%) 
48/69 (70%) 


0.0023 


ig 


322..389 


21/71 (30%) 
55/71 (77%) 


7.2e-10 



5 NOVX nucleic acids and their encoded polypeptides are useful in a variety of 

applications and contexts. The various NOVX nucleic acids and polypeptides according 
to the invention are useful as novel members of the protein families according to the 
presence of domains and sequence relatedness to previously described proteins. 
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Additionally, NOVX nucleic acids and polypeptides can also be used to identify proteins 
that are members of the family to which the NOVX polypeptides belong. 

Consistent with other known members of the family of proteins, identified in 
column 5 of Table A, the NOVX polypeptides of the present invention show homology to, 
5 and contain domains that are characteristic of, other members of such protein families. 

The NOVX nucleic acids and polypeptides can also be used to screen for 
molecules, which inhibit or enhance NOVX activity or function. Specifically, the nucleic 
acids and polypeptides according to the invention can be used as targets for the 
identification of small molecules that modulate or inhibit diseases associated with the 
10 protein families listed in Table A. 

The NOVX nucleic acids and polypeptides are also useful for detecting specific 
cell types. Details of the expression analysis for each NOVX are presented in Example 7. 
Accordingly, the NOVX nucleic acids, polypeptides, antibodies and related compounds 
according to the invention have diagnostic and therapeutic applications in the detection of 
1 5 a variety of diseases with differential expression in normal vs. diseased tissues, e.g. 
detection of a variety of cancers. 

Additional utilities for NOVX nucleic acids and polypeptides according to the 
invention are disclosed herein. 

NOVX clones 

20 NOVX nucleic acids and their encoded polypeptides are useful in a variety of 

applications and contexts. The various NOVX nucleic acids and polypeptides according 
to the invention are useful as novel members of the protein families according to the 
presence of domains and sequence relatedness to previously described proteins. 
Additionally, NOVX nucleic acids and polypeptides can also be used to identify proteins 

25 that are members of the family to which the NOVX polypeptides belong. 

The NOVX genes and their corresponding encoded proteins are useful for 
preventing, treating or ameliorating medical conditions, e.g., by protein or gene therapy. 
Pathological conditions can be diagnosed by determining the amount of the new protein in 
a sample or by determining the presence of mutations in the new genes. Specific uses are 

30 described for each of the NOVX genes, based on the tissues in which they are most highly 
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expressed. Uses include developing products for the diagnosis or treatment of a variety of 
diseases and disorders. 

The NOVX nucleic acids and proteins of the invention are useful in potential 
diagnostic and therapeutic applications and as a research tool. These include serving as a 

5 specific or selective nucleic acid or protein diagnostic and/or prognostic marker, wherein 
the presence or amount of the nucleic acid or the protein are to be assessed, as well as 
potential therapeutic applications such as the following: (i) a protein therapeutic, (ii) a 
small molecule drug target, (iii) an antibody target (therapeutic, diagnostic, drug 
targeting/cytotoxic antibody), (iv) a nucleic acid useful in gene therapy (gene 

10 delivery/gene ablation), and (v) a composition promoting tissue regeneration in vitro and 
in vivo (vi) a biological defense weapon. 

In one specific embodiment, the invention includes an isolated polypeptide 
comprising an amino acid sequence selected from the group consisting of: (a) a mature 
form of the amino acid sequence selected from the group consisting of SEQ ID NO: 2n, 

15 wherein n is an integer between 1 and 12; (b) a variant of a mature form of the amino acid 
sequence selected from the group consisting of SEQ ID NO: 2n, wherein n is an integer 
between 1 and 12, wherein any amino acid in the mature form is changed to a different 
amino acid, provided that no more than 1 5% of the amino acid residues in the sequence of 
the mature form are so changed; (c) an amino acid sequence selected from the group 

20 consisting of SEQ ID NO: 2n, wherein n is an integer between 1 and 12; (d) a variant of 
the amino acid sequence selected from the group consisting of SEQ ID NO:2n, wherein n 
is an integer between 1 and 12 wherein any amino acid specified in the chosen sequence is 
changed to a different amino acid, provided that no more than 15% of the amino acid 
residues in the sequence are so changed; and (e) a fragment of any of (a) through (d). 

25 In another specific embodiment, the invention includes an isolated nucleic acid 

molecule comprising a nucleic acid sequence encoding a polypeptide comprising an amino 
acid sequence selected from the group consisting of: (a) a mature form of the amino acid 
sequence given SEQ ID NO: 2n, wherein n is an integer between 1 and 12; (b) a variant of 
a mature form of the amino acid sequence selected from the group consisting of SEQ ID 

30 NO: 2n, wherein n is an integer between 1 and 1 2 wherein any amino acid in the mature 

form of the chosen sequence is changed to a different amino acid, provided that no more 

than 1 5% of the amino acid residues in the sequence of the mature form are so changed; 
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(c) the amino acid sequence selected from the group consisting of SEQ ID NO: 2n, 
wherein n is an integer between 1 and 12; (d) a variant of the amino acid sequence selected 
from the group consisting of SEQ ID NO: 2n, wherein n is an integer between 1 and 12, in 
which any amino acid specified in the chosen sequence is changed to a different amino 

5 acid, provided that no more than 1 5% of the amino acid residues in the sequence are so 
changed; (e) a nucleic acid fragment encoding at least a portion of a polypeptide 
comprising the amino acid sequence selected from the group consisting of SEQ ID NO: 
2n, wherein n is an integer between 1 and 12 or any variant of said polypeptide wherein 
any amino acid of the chosen sequence is changed to a different amino acid, provided that 

10 no more than 10% of the amino acid residues in the sequence are so changed; and (f) the 
complement of any of said nucleic acid molecules. 

In yet another specific embodiment, the invention includes an isolated nucleic acid 
molecule, wherein said nucleic acid molecule comprises a nucleotide sequence selected 
from the group consisting of: (a) the nucleotide sequence selected from the group 

15 consisting of SEQ ID NO: 2n-l, wherein n is an integer between 1 and 12; (b) a 

nucleotide sequence wherein one or more nucleotides in the nucleotide sequence selected 
from the group consisting of SEQ ID NO: 2n-l , wherein n is an integer between 1 and 12 
is changed from that selected from the group consisting of the chosen sequence to a 
different nucleotide provided that no more than 15% of the nucleotides are so changed; 

20 (c) a nucleic acid fragment of the sequence selected from the group consisting of SEQ ID 
NO: 2n-l, wherein n is an integer between 1 and 12; and (d) a nucleic acid fragment 
wherein one or more nucleotides in the nucleotide sequence selected from the group 
consisting of SEQ ID NO: 2n-l, wherein n is an integer between 1 and 12 is changed 
from that selected from the group consisting of the chosen sequence to a different 

25 nucleotide provided that no more than 15% of the nucleotides are so changed. 

NOVX Nucleic Acids and Polypeptides 

One aspect of the invention pertains to isolated nucleic acid molecules that encode 
NOVX polypeptides or biologically active portions thereof. Also included in the 
invention are nucleic acid fragments sufficient for use as hybridization probes to identify 
30 NOVX-encoding nucleic acids (e.g., NOVX mRNAs) and fragments for use as PCR 

primers for the amplification and/or mutation of NOVX nucleic acid molecules. As used 
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herein, the term "nucleic acid molecule" is intended to include DNA molecules (e.g., 
cDNA or genomic DNA), RNA molecules (e.g., mRNA), analogs of the DNA or RNA 
generated using nucleotide analogs, and derivatives, fragments and homologs thereof. The 
nucleic acid molecule can be single-stranded or double-stranded, but preferably is 

5 comprised double-stranded DNA. 

A NOVX nucleic acid can encode a mature NOVX polypeptide. As used herein, a 
"mature" form of a polypeptide or protein disclosed in the present invention is the product 
of a naturally occurring polypeptide or precursor form or proprotein. The naturally 
occurring polypeptide, precursor or proprotein includes, by way of nonlimiting example, 

10 the full-length gene product encoded by the corresponding gene. Alternatively, it can be 
defined as the polypeptide, precursor or proprotein encoded by an ORF described herein. 
The product "mature" form arises, by way of nonlimiting example, as a result of one or 
more naturally occurring processing steps that may take place within the cell (e.g., host 
cell) in which the gene product arises. Examples of such processing steps leading to a 

1 5 "mature" form of a polypeptide or protein include the cleavage of the N-terminal 

methionine residue encoded by the initiation codon of an ORF, or the proteolytic cleavage 
of a signal peptide or leader sequence. Thus a mature form arising from a precursor 
polypeptide or protein that has residues 1 to N, where residue 1 is the N-terminal 
methionine, would have residues 2 through N remaining after removal of the N-terminal 

20 methionine. Alternatively, a mature form arising from a precursor polypeptide or protein 
having residues 1 to N, in which an N-terminal signal sequence from residue 1 to residue 
M is cleaved, would have the residues from residue M+l to residue N remaining. Further 
as used herein, a "mature" form of a polypeptide or protein may arise from a step of 
post-translational modification other than a proteolytic cleavage event. Such additional 

25 processes include, by way of non-limiting example, glycosylation, myristylation or 
phosphorylation. In general, a mature polypeptide or protein may result from the 
operation of only one of these processes, or a combination of any of them. 

The term "probe", as utilized herein, refers to nucleic acid sequences of variable 
length, preferably between at least about 10 nucleotides (nt), about 100 nt, or as many as 

30 approximately, e.g. , 6,000 nt, depending upon the specific use. Probes are used in the 

detection of identical, similar, or complementary nucleic acid sequences. Longer length 

probes are generally obtained from a natural or recombinant source, are highly specific, 
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and much slower to hybridize than shorter-length oligomer probes. Probes can be single- 
stranded or double-stranded and designed to have specificity in PCR, membrane-based 
hybridization technologies, or ELISA-like technologies. 

The term "isolated" nucleic acid molecule, as used herein, is a nucleic acid that is 

5 separated from other nucleic acid molecules which are present in the natural source of the 
nucleic acid. Preferably, an "isolated" nucleic acid is free of sequences which naturally 
flank the nucleic acid (Le. 9 sequences located at the 5'- and 3'-termini of the nucleic acid) 
in the genomic DNA of the organism from which the nucleic acid is derived. For 
example, in various embodiments, the isolated NOVX nucleic acid molecules can contain 

10 less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide sequences which 
naturally flank the nucleic acid molecule in genomic DNA of the cell/tissue from which 
the nucleic acid is derived (e.g., brain, heart, liver, spleen, etc.). Moreover, an "isolated" 
nucleic acid molecule, such as a cDNA molecule, can be substantially free of other 
cellular material, or culture medium, or of chemical precursors or other chemicals. 

1 5 A nucleic acid molecule of the invention, e.g. , a nucleic acid molecule having the 

nucleotide sequence of SEQ ID NO:2n-l , wherein n is an integer between 1 and 12, or a 
complement of this nucleotide sequence, can be isolated using standard molecular biology 
techniques and the sequence information provided herein. Using all or a portion of the 
nucleic acid sequence of SEQ ID NO:2n-l, wherein n is an integer between 1 and 12, as a 

20 hybridization probe, NOVX molecules can be isolated using standard hybridization and 
cloning techniques (e.g., as described in Sambrook, et a/., (eds.), MOLECULAR CLONING: A 
Laboratory Manual 2 nd Ed., Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, 1989; and Ausubel, et a/., (eds.), CURRENT PROTOCOLS IN MOLECULAR 
BIOLOGY, John Wiley & Sons, New York, NY, 1993.) 

25 A nucleic acid of the invention can be amplified using cDNA, mRNA or 

alternatively, genomic DNA, as a template with appropriate oligonucleotide primers 
according to standard PCR amplification techniques. The nucleic acid so amplified can be 
cloned into an appropriate vector and characterized by DNA sequence analysis. 
Furthermore, oligonucleotides corresponding to NOVX nucleotide sequences can be 

30 prepared by standard synthetic techniques, e.g. , using an automated DNA synthesizer. 

As used herein, the term "oligonucleotide" refers to a series of linked nucleotide 

residues. A short oligonucleotide sequence can be based on, or designed from, a genomic 
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or cDNA sequence and is used to amplify, confirm, or reveal the presence of an identical, 
similar or complementary DNA or RNA in a particular cell or tissue. Oligonucleotides 
comprise a nucleic acid sequence having about 10 nt, 50 nt, or 100 nt in length, preferably 
about 15 nt to 30 nt in length. In one embodiment of the invention, an oligonucleotide 

5 comprising a nucleic acid molecule less than 100 nt in length would further comprise at 
least 6 contiguous nucleotides of SEQ ID NO:2w-l , wherein n is an integer between 1 and 
12, or a complement thereof. Oligonucleotides can be chemically synthesized and can 
also be used as probes. 

In another embodiment, an isolated nucleic acid molecule of the invention 

10 comprises a nucleic acid molecule that is a complement of the nucleotide sequence shown 
in SEQ ID NO:2/i-l, wherein n is an integer between 1 and 12, or a portion of this 
nucleotide sequence (e.g., a fragment that can be used as a probe or primer or a fragment 
encoding a biologically-active portion of a NOVX polypeptide). A nucleic acid molecule 
that is complementary to the nucleotide sequence of SEQ ID NO:2/i-l, wherein n is an 

15 integer between 1 and 12, is one that is sufficiently complementary to the nucleotide 
sequence of SEQ ID NO:2n-l, wherein n is an integer between 1 and 12, that it can 
hydrogen bond with few or no mismatches to the nucleotide sequence shown in SEQ ID 
NO:2/M , wherein n is an integer between 1 and 12, thereby forming a stable duplex. 

As used herein, the term "complementary 5 * refers to Watson-Crick or Hoogsteen 

20 base pairing between nucleotides units of a nucleic acid molecule, and the term "binding" 
means the physical or chemical interaction between two polypeptides or compounds or 
associated polypeptides or compounds or combinations thereof. Binding includes ionic, 
non-ionic, van der Waals, hydrophobic interactions, and the like. A physical interaction 
can be either direct or indirect. Indirect interactions may be through or due to the effects 

25 of another polypeptide or compound. Direct binding refers to interactions that do not take 
place through, or due to, the effect of another polypeptide or compound, but instead are 
without other substantial chemical intermediates. 

A "fragment" provided herein is defined as a sequence of at least 6 (contiguous) 
nucleic acids or at least 4 (contiguous) amino acids, a length sufficient to allow for 

30 specific hybridization in the case of nucleic acids or for specific recognition of an epitope 
in the case of amino acids, and is at most some portion less than a full length sequence. 
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Fragments can be derived from any contiguous portion of a nucleic acid or amino acid 
sequence of choice. 

A full-length NOVX clone is identified as containing an ATG translation start 
codon and an in-frame stop codon. Any disclosed NOVX nucleotide sequence lacking an 

5 ATG start codon therefore encodes a truncated C-terminal fragment of the respective 

NOVX polypeptide, and requires that the corresponding full-length cDNA extend in the 5' 
direction of the disclosed sequence. Any disclosed NOVX nucleotide sequence lacking an 
in-frame stop codon similarly encodes a truncated N-terminal fragment of the respective 
NOVX polypeptide, and requires that the corresponding full-length cDNA extend in the 3' 

10 direction of the disclosed sequence. 

A "derivative" is a nucleic acid sequence or amino acid sequence formed from the 
native compounds either directly, by modification or partial substitution. An "analog" is a 
nucleic acid sequence or amino acid sequence that has a structure similar to, but not 
identical to, the native compound, eg. they differs from it in respect to certain components 

15 or side chains. Analogs can be synthetic or derived from a different evolutionary origin 
and may have a similar or opposite metabolic activity compared to wild type. A 
"homolog" is a nucleic acid sequence or amino acid sequence of a particular gene that is 
derived from different species. 

Derivatives and analogs can be full length or other than full length. Derivatives or 

20 analogs of the nucleic acids or proteins of the invention include, but are not limited to, 
molecules comprising regions that are substantially homologous to the nucleic acids or 
proteins of the invention, in various embodiments, by at least about 70%, 80%, or 95% 
identity (with a preferred identity of 80-95%) over a nucleic acid or amino acid sequence 
of identical size or when compared to an aligned sequence in which the alignment is done 

25 by a computer homology program known in the art, or whose encoding nucleic acid is 
capable of hybridizing to the complement of a sequence encoding the proteins under 
stringent, moderately stringent, or low stringent conditions. See e.g. Ausubel, et al. 9 
Current Protocols in Molecular Biology, John Wiley & Sons, New York, NY, 
1993, and below. 

30 A "homologous nucleic acid sequence" or "homologous amino acid sequence," or 

variations thereof, refer to sequences characterized by a homology at the nucleotide level 

or amino acid level as discussed above. Homologous nucleotide sequences include those 

46 



WO 03/080856 



PCT/US03/08490 



sequences coding for isoforms of NOVX polypeptides. Isoforms can be expressed in 
different tissues of the same organism as a result of, for example, alternative splicing of 
RNA. Alternatively, isoforms can be encoded by different genes. In the invention, 
homologous nucleotide sequences include nucleotide sequences encoding for a NOVX 

5 polypeptide of species other than humans, including, but not limited to: vertebrates, and 
thus can include, e.g. 9 frog, mouse, rat, rabbit, dog, cat cow, horse, and other organisms. 
Homologous nucleotide sequences also include, but are not limited to, naturally occurring 
allelic variations and mutations of the nucleotide sequences set forth herein. A 
homologous nucleotide sequence does not, however, include the exact nucleotide 

10 sequence encoding human NOVX protein. Homologous nucleic acid sequences include 
those nucleic acid sequences that encode conservative amino acid substitutions (see 
below) in SEQ ID NO:2/M, wherein n is an integer between 1 and 12, as well as a 
polypeptide possessing NOVX biological activity. Various biological activities of the 
NOVX proteins are described below. 

1 5 A NOVX polypeptide is encoded by the open reading frame ("ORF") of a NOVX 

nucleic acid. An ORF corresponds to a nucleotide sequence that could potentially be 
translated into a polypeptide. A stretch of nucleic acids comprising an ORF is 
uninterrupted by a stop codon. An ORF that represents the coding sequence for a full 
protein begins with an ATG "start 11 codon and terminates with one of the three "stop" 

20 codons, namely, TAA, TAG, or TGA. For the purposes of this invention, an ORF can be 
any part of a coding sequence, with or without a start codon, a stop codon, or both. For an 
ORF to be considered as a good candidate for coding for a bona fide cellular protein, a 
minimum size requirement is often set, e.g., a stretch of DNA that would encode a protein 
of 50 amino acids or more. 

25 The nucleotide sequences determined from the cloning of the human NOVX genes 

allows for the generation of probes and primers designed for use in identifying and/or 
cloning NOVX homologues in other cell types, e.g. from other tissues, as well as NOVX 
homologues from other vertebrates. The probe/primer typically comprises substantially 
purified oligonucleotide. The oligonucleotide typically comprises a region of nucleotide 

30 sequence that hybridizes under stringent conditions to at least about 12, 25, 50, 100, 150, 

200, 250, 300, 350 or 400 consecutive sense strand nucleotide sequence of SEQ ID 

NO:2n-l, wherein n is an integer between 1 and 12; or an anti-sense strand nucleotide 
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sequence of SEQ ID NO:2«-l, wherein n is an integer between 1 and 12; or of a naturally 
occurring mutant of SEQ ID NO:2/i«l, wherein n is an integer between 1 and 12. 

Probes based on the human NOVX nucleotide sequences can be used to detect 
transcripts or genomic sequences encoding the same or homologous proteins. In various 

5 embodiments, the probe has a detectable label attached, e.g. the label can be a 

radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. Such probes 
can be used as a part of a diagnostic test kit for identifying cells or tissues which 
mis-express a NOVX protein, such as by measuring a level of a NOVX-encoding nucleic 
acid in a sample of cells from a subject e.g., detecting NOVX mRNA levels or 

1 0 determining whether a genomic NOVX gene has been mutated or deleted. 

"A polypeptide having a biologically-active portion of a NOVX polypeptide** 
refers to polypeptides exhibiting activity similar, but not necessarily identical to, an 
activity of a polypeptide of the invention, including mature forms, as measured in a 
particular biological assay, with or without dose dependency. A nucleic acid fragment 

15 encoding a "biologically-active portion of NOVX" can be prepared by isolating a portion 
of SEQ ID NO:2/i-l, wherein n is an integer between 1 and 12, that encodes a polypeptide 
having a NOVX biological activity (the biological activities of the NOVX proteins are 
described below), expressing the encoded portion of NOVX protein (eg., by recombinant 
expression in vitro) and assessing the activity of the encoded portion of NOVX. 

20 NOVX Nucleic Acid and Polypeptide Variants 

The invention further encompasses nucleic acid molecules that differ from the 
nucleotide sequences of SEQ ID NO:2n-l, wherein n is an integer between 1 and 12, due 
to degeneracy of the genetic code and thus encode the same NOVX proteins as that 
encoded by the nucleotide sequences of SEQ ID NO:2/i-l , wherein n is an integer between 
25 1 and 12. In another embodiment, an isolated nucleic acid molecule of the invention has a 
nucleotide sequence encoding a protein having an amino acid sequence of SEQ ID NO:2«, 
wherein n is an integer between 1 and 12. 

In addition to the human NOVX nucleotide sequences of SEQ ID NO:2/i-l , 
wherein n is an integer between 1 and 12, it will be appreciated by those skilled in the art 
30 that DNA sequence polymorphisms that lead to changes in the amino acid sequences of 
the NOVX polypeptides may exist within a population (eg., the human population). Such 
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genetic polymorphism in the NOVX genes may exist among individuals within a 
population due to natural allelic variation. As used herein, the terms "gene" and 
"recombinant gene" refer to nucleic acid molecules comprising an open reading frame 
(ORF) encoding a NOVX protein, preferably a vertebrate NOVX protein. Such natural 

5 allelic variations can typically result in 1-5% variance in the nucleotide sequence of the 
NOVX genes. Any and all such nucleotide variations and resulting amino acid 
polymorphisms in the NOVX polypeptides, which are the result of natural allelic variation 
and that do not alter the functional activity of the NOVX polypeptides, are intended to be 
within the scope of the invention. 

10 Moreover, nucleic acid molecules encoding NOVX proteins from other species, 

and thus that have a nucleotide sequence that differs from a human SEQ ID NO:2w-l , 
wherein n is an integer between 1 and 12, are intended to be within the scope of the 
invention. Nucleic acid molecules corresponding to natural allelic variants and 
homologues of the NOVX cDNAs of the invention can be isolated based on their 

15 homology to the human NOVX nucleic acids disclosed herein using the human cDNAs, or 
a portion thereof, as a hybridization probe according to standard hybridization techniques 
under stringent hybridization conditions. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 6 nucleotides in length and hybridizes under stringent conditions to the 

20 nucleic acid molecule comprising the nucleotide sequence of SEQ ID NO:2n-l , wherein n 
is an integer between 1 and 12. In another embodiment, the nucleic acid is at least 10, 25, 
50, 100, 250, 500, 750, 1000, 1500, or 2000 or more nucleotides in length. In yet another 
embodiment, an isolated nucleic acid molecule of the invention hybridizes to the coding 
region. As used herein, the term "hybridizes under stringent conditions" is intended to 

25 describe conditions for hybridization and washing under which nucleotide sequences at 
least about 65% homologous to each other typically remain hybridized to each other. 

Homologs (i.e., nucleic acids encoding NOVX proteins derived from species other 
than human) or other related sequences (e.g., paralogs) can be obtained by low, moderate 
or high stringency hybridization with all or a portion of the particular human sequence as a 

30 probe using methods well known in the art for nucleic acid hybridization and cloning. 

As used herein, the phrase "stringent hybridization conditions" refers to conditions 

under which a probe, primer or oligonucleotide will hybridize to its target sequence, but to 
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no other sequences. Stringent conditions are sequence-dependent and will be different in 
different circumstances. Longer sequences hybridize specifically at higher temperatures 
than shorter sequences. Generally, stringent conditions are selected to be about 5 °C lower 
than the thermal melting point (Tm) for the specific sequence at a defined ionic strength 
5 and pH. The Tm is the temperature (under defined ionic strength, pH and nucleic acid 
concentration) at which 50% of the probes complementary to the target sequence 
hybridize to the target sequence at equilibrium. Since the target sequences are generally 
present at excess, at Tm, 50% of the probes are occupied at equilibrium. Typically, 
stringent conditions are those in which the salt concentration is less than about 1 .0 M 
1 0 sodium ion, typically about 0.01 to 1 .0 M sodium ion (or other salts) at pH 7.0 to 8.3 and 
the temperature is at least about 30 °C for short probes, primers or oligonucleotides (e.g., 
10 nt to 50 nt) and at least about 60 °C for longer probes, primers and oligonucleotides. 
Stringent conditions can also be achieved with the addition of destabilizing agents, such as 
formamide. 

1 5 Stringent conditions are known to those skilled in the art and can be found in 

Ausubel, et al., (eds.), CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John Wiley & 
Sons, N.Y. (1989), 6.3.1-6.3.6. Preferably, the conditions are such that sequences at least 
about 65%, 70%, 75%, 85%, 90%, 95%, 98%, or 99% homologous to each other typically 
remain hybridized to each other. A non-limiting example of stringent hybridization 

20 conditions are hybridization in a high salt buffer comprising 6X SSC, SO mM Tris-HCl 
(pH 7.5), 1 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA, and 500 mg/ml denatured 
salmon sperm DNA at 65°C, followed by one or more washes in 0.2X SSC, 0.01% BSA at 
50°C. An isolated nucleic acid molecule of the invention that hybridizes under stringent 
conditions to a sequence of SEQ ID NO:2n-l, wherein n is an integer between 1 and 12, 

25 corresponds to a naturally-occurring nucleic acid molecule. As used herein, a 

"naturally-occurring" nucleic acid molecule refers to an RNA or DNA molecule having a 
nucleotide sequence that occurs in nature (e.g., encodes a natural protein). 

In a second embodiment, a nucleic acid sequence that is hybridizable to the nucleic 
acid molecule comprising the nucleotide sequence of SEQ ID NO:2/i-l , wherein n is an 

30 integer between 1 and 12, or fragments, analogs or derivatives thereof, under conditions of 

moderate stringency is provided. A non-limiting example of moderate stringency 

hybridization conditions are hybridization in 6X SSC, 5X Reinhardfs solution, 0.5% SDS 
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and 100 mg/ml denatured salmon sperm DNA at 55 °C, followed by one or more washes 
in IX SSC, 0.1% SDS at 37 °C. Other conditions of moderate stringency that can be used 
are well-known within the art. See, e.g., Ausubel, et al. (eds.), 1993, CURRENT PROTOCOLS 
in Molecular Biology, John Wiley & Sons, NY, and Krieger, 1990; Gene Transfer 

5 and Expression, a Laboratory Manual, Stockton Press, NY. 

In a third embodiment, a nucleic acid that is hybridizable to the nucleic acid 
molecule comprising the nucleotide sequences of SEQ ID NO:2n-l, wherein n is an 
integer between 1 and 12, or fragments, analogs or derivatives thereof, under conditions of 
low stringency, is provided. A non-limiting example of low stringency hybridization 

10 conditions are hybridization in 35% formamide, 5X SSC, 50 mM Tris-HCl (pH 7.5), 5 
mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 mg/ml denatured salmon sperm 
DNA, 10% (wt/vol) dextran sulfate at 40°C, followed by one or more washes in 2X SSC, 
25 mM Tris-HCl (pH 7.4), 5 mM EDTA, and 0.1% SDS at 50°C. Other conditions of low 
stringency that can be used are well known in the art {e.g., as employed for cross-species 

1 5 hybridizations). See, e.g., Ausubel, et al. (eds.), 1993, CURRENT PROTOCOLS IN 

Molecular Biology, John Wiley & Sons, NY, and Kriegler, 1990, Gene Transfer and 
Expression, A Laboratory Manual, Stockton Press, NY; Shilo and Weinberg, 1981 . 
Proc Natl Acad Sci USA 78: 6789-6792. 

Conservative Mutations 

20 In addition to naturally-occurring allelic variants of NOVX sequences that may 

exist in the population, the skilled artisan will further appreciate that changes can be 
introduced by mutation into the nucleotide sequences of SEQ ID NO:2/M, wherein n is an 
integer between 1 and 12, thereby leading to changes in the amino acid sequences of the 
encoded NOVX protein, without altering the functional ability of that NOVX protein. For 

25 example, nucleotide substitutions leading to amino acid substitutions at "non-essential" 
amino acid residues can be made in the sequence of SEQ ID NO:2n, wherein n is an 
integer between 1 and 12. A "non-essential" amino acid residue is a residue that can be 
altered from the wild-type sequences of the NOVX proteins without altering their 
biological activity, whereas an "essential" amino acid residue is required for such 

30 biological activity. For example, amino acid residues that are conserved among the 
NOVX proteins of the invention are predicted to be particularly non-amenable to 
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alteration. Amino acids for which conservative substitutions can be made are well-known 
within the art. 

Another aspect of the invention pertains to nucleic acid molecules encoding 
NOVX proteins that contain changes in amino acid residues that are not essential for 

5 activity. Such NOVX proteins differ in amino acid sequence from SEQ ID NO:2/i-l, 
wherein n is an integer between 1 and 12, yet retain biological activity. In one 
embodiment, the isolated nucleic acid molecule comprises a nucleotide sequence encoding 
a protein, wherein the protein comprises an amino acid sequence at least about 40% 
homologous to the amino acid sequences of SEQ ID NO:2/i, wherein n is an integer 

10 between 1 and 12. Preferably, the protein encoded by the nucleic acid molecule is at least 
about 60% homologous to SEQ ID NO:2n, wherein n is an integer between 1 and 12; more 
preferably at least about 70% homologous to SEQ ID NO:2n, wherein n is an integer 
between 1 and 12; still more preferably at least about 80% homologous to SEQ ID NO:2/i, 
wherein n is an integer between 1 and 12; even more preferably at least about 90% 

15 homologous to SEQ ID NO:2/i, wherein n is an integer between 1 and 12; and most 
preferably at least about 95% homologous to SEQ ID NO:2*, wherein n is an integer 
between 1 and 12. 

An isolated nucleic acid molecule encoding a NOVX protein homologous to the 
protein of SEQ ID NO:2«, wherein n is an integer between 1 and 12, can be created by 
20 introducing one or more nucleotide substitutions, additions or deletions into the nucleotide 
sequence of SEQ ID NO:2n-l, wherein n is an integer between 1 and 12, such that one or 
more amino acid substitutions, additions or deletions are introduced into the encoded 
protein. 

Mutations can be introduced any one of SEQ ID NO:2/?-l, wherein n is an integer 

25 between 1 and 12, by standard techniques, such as site-directed mutagenesis and 

PCR-mediated mutagenesis. Preferably, conservative amino acid substitutions are made at 

one or more predicted, non-essential amino acid residues. A "conservative amino acid 

substitution" is one in which the amino acid residue is replaced with an amino acid residue 

having a similar side chain. Families of amino acid residues having similar side chains 

30 have been defined within the art. These families include amino acids with basic side 

chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic 

acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, threonine, 
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tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, proline, 
phenylalanine, methionine, tryptophan), beta-branched side chains (e.g., threonine, valine, 
isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). 
Thus, a predicted non-essential amino acid residue in the NOVX protein is replaced with 

5 another amino acid residue from the same side chain family. Alternatively, in another 
embodiment, mutations can be introduced randomly along all or part of a NOVX coding 
sequence, such as by saturation mutagenesis, and the resultant mutants can be screened for 
NOVX biological activity to identify mutants that retain activity. Following mutagenesis 
of a nucleic acid of SEQ ID NO:2«-l, wherein n is an integer between 1 and 12, the 

10 encoded protein can be expressed by any recombinant technology known in the art and the 
activity of the protein can be determined. 

The relatedness of amino acid families can also be determined based on side chain 
interactions. Substituted amino acids can be fully conserved "strong" residues or fully 
conserved "weak" residues. The "strong" group of conserved amino acid residues can be 

1 5 any one of the following groups: STA, NEQK, NHQK, NDEQ, QHRK, MILV, MILF, 
HY, FYW, wherein the single letter amino acid codes are grouped by those amino acids 
that can be substituted for each other. Likewise, the "weak" group of conserved residues 
can be any one of the following: CSA, ATV, SAG, STNK, STPA, SGND, SNDEQK, 
NDEQHK, NEQHRK, HFY, wherein the letters within each group represent the single 

20 letter amino acid code. 

In one embodiment, a mutant NOVX protein can be assayed for (i) the ability to 
form protein:protein interactions with other NOVX proteins, other cell-surface proteins, or 
biologically-active portions thereof, (if) complex formation between a mutant NOVX 
protein and a NOVX ligand; or (Hi) the ability of a mutant NOVX protein to bind to an 

25 intracellular target protein or biologically-active portion thereof; (e.g. avidin proteins). 

In yet another embodiment, a mutant NOVX protein can be assayed for the ability 
to regulate a specific biological function (e.g., regulation of insulin release). 

Interfering RNA 

In one aspect of the invention, NOVX gene expression can be attenuated by RNA 
30 interference. One approach well-known in the art is short interfering RNA (siRNA) 
mediated gene silencing where expression products of a NOVX gene are targeted by 
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specific double stranded NOVX derived siRNA nucleotide sequences that are 
complementary to at least a 19-25 nt long segment of the NOVX gene transcript, including 
the 5' untranslated (UT) region, the ORF, or the 3* UT region. See, e.g., PCT applications 
WO00/44895, W099/32619, WO01/75164, WO01/92513, WO 01/29058, WO01/89304, 

5 WO02/16620, and WO02/29858, each incorporated by reference herein in their entirety. 
Targeted genes can be a NOVX gene, or an upstream or downstream modulator of the 
NOVX gene. Nonlimiting examples of upstream or downstream modulators of a NOVX 
gene include, e.g., a transcription factor that binds the NOVX gene promoter, a kinase or 
phosphatase that interacts with a NOVX polypeptide, and polypeptides involved in a 

1 0 NOVX regulatory pathway. 

According to the methods of the present invention, NOVX gene expression is 
silenced using short interfering RNA. A NOVX polynucleotide according to the invention 
includes a siRNA polynucleotide. Such a NOVX siRNA can be obtained using a NOVX 
polynucleotide sequence, for example, by processing the NOVX ribopolynucleotide 

15 sequence in a cell-free system, such as but not limited to a Drosophila extract, or by 
transcription of recombinant double stranded NOVX RNA or by chemical synthesis of 
nucleotide sequences homologous to a NOVX sequence. See, e.g., Tuschl, Zamore, 
Lehmann, Bartel and Sharp (1999), Genes & Dev. 13: 3191-3197, incorporated herein by 
reference in its entirety. When synthesized, a typical 0.2 micromolar-scale RNA synthesis 

20 provides about 1 milligram of siRNA, which is sufficient for 1000 transfection 
experiments using a 12-well tissue culture plate format 

The most efficient silencing is generally observed with siRNA duplexes composed 
of a 21-nt sense strand and a 21-nt antisense strand, paired in a manner to have a 2-nt 
V overhang. The sequence of the 2-nt 3' overhang makes an additional small contribution 

25 to the specificity of siRNA target recognition. The contribution to specificity is localized 
to the unpaired nucleotide adjacent to the first paired bases. In one embodiment, the 
nucleotides in the 3* overhang are ribonucleotides. In an alternative embodiment, the 
nucleotides in the 3' overhang are deoxyribonucleotides. Using 2 , -deoxyribonucleotides 
in the 3' overhangs is as efficient as using ribonucleotides, but deoxyribonucleotides are 

30 often cheaper to synthesize and are most likely more nuclease resistant. 

A contemplated recombinant expression vector of the invention comprises a 

NOVX DNA molecule cloned into an expression vector comprising operatively-linked 
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regulatory sequences flanking the NOVX sequence in a manner that allows for expression 
(by transcription of the DNA molecule) of both strands. An RNA molecule that is 
antisense to NOVX mRNA is transcribed by a first promoter (e.g., a promoter sequence 3* 
of the cloned DNA) and an RNA molecule that is the sense strand for the NOVX mRNA 

5 is transcribed by a second promoter (e.g., a promoter sequence 5' of the cloned DNA). 
The sense and antisense strands may hybridize in vivo to generate siRNA constructs for 
silencing of the NOVX gene. Alternatively, two constructs can be utilized to create the 
sense and anti-sense strands of a siRNA construct. Finally, cloned DNA can encode a 
construct having secondary structure, wherein a single transcript has both the sense and 

10 complementary antisense sequences from the target gene or genes. In an example of this 
embodiment, a hairpin RNAi product is homologous to all or a portion of the target gene. 
In another example, a hairpin RNAi product is a siRNA. The regulatory sequences 
flanking the NOVX sequence may be identical or may be different, such that their 
expression may be modulated independently, or in a temporal or spatial manner. 

15 In a specific embodiment, siRNAs are transcribed intracellularly by cloning the 

NOVX gene templates into a vector containing, eg., a RNA pol m transcription unit from 
the smaller nuclear RNA (snRNA) U6 or the human RNase P RNA HI . One example of a 
vector system is the GeneSuppressor™ RNA Interference kit (commercially available 
from Imgenex). The U6 and HI promoters are members of the type m class of Pol III 

20 promoters. The +1 nucleotide of the U6-like promoters is always guanosine, whereas the 
+1 for HI promoters is adenosine. The termination signal for these promoters is defined by 
five consecutive thymidines. The transcript is typically cleaved after the second uridine. 
Cleavage at this position generates a 3' UU overhang in the expressed siRNA, which is 
similar to the 3' overhangs of synthetic siRNAs. Any sequence less than 400 nucleotides in 

25 length can be transcribed by these promoter, therefore they are ideally suited for the 

expression of around 21 -nucleotide siRNAs in, e.g., an approximately 50-nucleotide RNA 
stem-loop transcript. 

A siRNA vector appears to have an advantage over synthetic siRNAs where long 
term knock-down of expression is desired. Cells transfected with a siRNA expression 

30 vector would experience steady, long-term mRNA inhibition. In contrast, cells transfected 
with exogenous synthetic siRNAs typically recover from mRNA suppression within seven 
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days or ten rounds of cell division. The long-term gene silencing ability of siRNA 
expression vectors may provide for applications in gene therapy. 

In general, siRNAs are chopped from longer dsRNA by an ATP-dependent 
ribonuclease called DICER. DICER is a member of the RNase TH family of double- 

5 stranded RNA-specific endonucleases. The siRNAs assemble with cellular proteins into an 
endonuclease complex. In vitro studies in Drosophila suggest that the siRNAs/protein 
complex (siRNP) is then transferred to a second enzyme complex, called an RNA-induced 
silencing complex (RISC), which contains an endoribonuclease that is distinct from 
DICER. RISC uses the sequence encoded by the antisense siRNA strand to find and 

10 destroy mRNAs of complementary sequence. The siRNA thus acts as a guide, restricting 
the ribonuclease to cleave only mRNAs complementary to one of the two siRNA strands. 

A NOVX mRNA region to be targeted by siRNA is generally selected from a 
desired NOVX sequence beginning 50 to 100 nt downstream of the start codon. 
Alternatively, 5' or 3' UTRs and regions nearby the start codon can be used but are 

1 5 generally avoided, as these may be richer in regulatory protein binding sites. UTR-binding 
proteins and/or translation initiation complexes may interfere with binding of the siRNP or 
RISC endonuclease complex. An initial BLAST homology search for the selected siRNA 
sequence is done against an available nucleotide sequence library to ensure that only one 
gene is targeted. Specificity of target recognition by siRNA duplexes indicate that a single 

20 point mutation located in the paired region of an siRNA duplex is sufficient to abolish 
target mRNA degradation. See, Elbashir et al 2001 EMBO J. 20(23):6877-88. Hence, 
consideration should be taken to accommodate SNPs, polymorphisms, allelic variants or 
species-specific variations when targeting a desired gene. 

In one embodiment, a complete NOVX siRNA experiment includes the proper 

25 negative control. A negative control siRNA generally has the same nucleotide 

composition as the NOVX siRNA but lack significant sequence homology to the genome. 
Typically, one would scramble the nucleotide sequence of the NOVX siRNA and do a 
homology search to make sure it lacks homology to any other gene. 

Two independent NOVX siRNA duplexes can be used to knock-down a target 

30 NOVX gene. This helps to control for specificity of the silencing effect In addition, 

expression of two independent genes can be simultaneously knocked down by using equal 
concentrations of different NOVX siRNA duplexes, a NOVX siRNA and an siRNA 
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for a regulator of a NOVX gene or polypeptide. Availability of siRNA-associating 
proteins is believed to be more limiting than target mRNA accessibility. 

A targeted NOVX region is typically a sequence of two adenines (AA) and two 
thymidines (TT) divided by a spacer region of nineteen (N19) residues (e.g., 

5 AA(N19)TT). A desirable spacer region has a G/C-content of approximately 30% to 70%, 
and more preferably of about 50%. If the sequence AA(N19)TT is not present in the 
target sequence, an alternative target region would be AA(N21). The sequence of the 
NOVX sense siRNA corresponds to (N19)TT orN21, respectively. In the latter case, 
conversion of the 3* end of the sense siRNA to TT can be performed if such a sequence 

10 does not naturally occur in the NOVX polynucleotide. The rationale for this sequence 
conversion is to generate a symmetric duplex with respect to the sequence composition of 
the sense and antisense 3* overhangs. Symmetric 3* overhangs may help to ensure that the 
siRNPs are formed with approximately equal ratios of sense and antisense target RNA- 
cleaving siRNPs. See, e.g., Elbashir, Lendeckel and Tuschl (2001). Genes & Dev. 15: 

1 5 1 88-200, incorporated by reference herein in its entirely. The modification of the 

overhang of the sense sequence of the siRNA duplex is not expected to affect targeted 
mRNA recognition, as the antisense siRNA strand guides target recognition. 

Alternatively, if the NOVX target mRNA does not contain a suitable AA(N21) 
sequence, one can search for the sequence NA(N2 1). Further, the sequence of the sense 

20 strand and antisense strand may still be synthesized as 5' (N19)TT, as it is believed that the 
sequence of the 3 -most nucleotide of the antisense siRNA does not contribute to 
specificity. Unlike antisense or ribozyme technology, the secondary structure of the target 
mRNA does not appear to have a strong effect on silencing. See 9 Harborth, et al. (2001) J. 
Cell Science 1 14: 4557-4565, incorporated by reference in its entirety. 

25 Transfection of NOVX siRNA duplexes can be achieved using standard nucleic 

acid transfection methods, for example, OLIGOFECT AMINE Reagent (commercially 
available from Invitrogen). An assay for NOVX gene silencing is generally performed 
approximately 2 days after transfection. No NOVX gene silencing has been observed in 
the absence of transfection reagent, allowing for a comparative analysis of the wild-type 

30 and silenced NOVX phenotypes. In a specific embodiment, for one well of a 24-well 

plate, approximately 0.84 jig of the siRNA duplex is generally sufficient. Cells are 

typically seeded the previous day, and are transfected at about 50% confluence. The 
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choice of cell culture media and conditions are routine to those of skill in the art, and will 
vary with the choice of cell type. The efficiency of transfection may depend on the cell 
type, but also on the passage number and the confluency of the cells. The time and the 
manner of formation of siRNA-liposome complexes (e.g. inversion versus vortexing) are 

5 also critical. Low transfection efficiencies are the most frequent cause of unsuccessful 
NOVX silencing. The efficiency of transfection needs to be carefully examined for each 
new cell line to be used. Preferred cell are derived from a mammal, more preferably from 
a rodent such as a rat or mouse, and most preferably from a human. Where used for 
therapeutic treatment, the cells are preferentially autologous, although non-autologous cell 

10 sources are also contemplated as within the scope of the present invention. 

For a control experiment, transfection of 0.84 jig single-stranded sense NOVX 
siRNA will have no effect on NOVX silencing, and 0.84 \ig antisense siRNA has a weak 
silencing effect when compared to 0.84 \ig of duplex siRNAs. Control experiments again 
allow for a comparative analysis of the wild-type and silenced NOVX phenotypes. To 

1 5 control for transfection efficiency, targeting of common proteins is typically performed, 
for example targeting of lamin A/C or transfection of a CMV-driven EGFP-expression 
plasmid (e.g. commercially available from Clontech). In the above example, a 
determination of the fraction of lamin A/C knockdown in cells is determined the next day 
by such techniques as immunofluorescence, Western blot, Northern blot or other similar 

20 assays for protein expression or gene expression. Lamin A/C monoclonal antibodies can 
be obtained from Santa Cruz Biotechnology. 

Depending on the abundance and the half life (or turnover) of the targeted NOVX 
polynucleotide in a cell, a knock-down phenotype may become apparent after 1 to 3 days, 
or even later. In cases where no NOVX knock-down phenotype is observed, depletion of 

25 the NOVX polynucleotide may be observed by immunofluorescence or Western blotting. 
If the NOVX polynucleotide is still abundant after 3 days, cells need to be split and 
transferred to a fresh 24-well plate for re-transfection. If no knock-down of the targeted 
protein is observed, it may be desirable to analyze whether the target mRNA (NOVX or a 
NOVX upstream or downstream gene) was effectively destroyed by the transfected siRNA 

30 duplex. Two days after transfection, total RNA is prepared, reverse transcribed using a 

target-specific primer, and PCR-amplified with a primer pair covering at least one exon- 

exon junction in order to control for amplification of pre-mRNAs. RT/PCR of a non- 
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targeted mRNA is also needed as control. Effective depletion of the mRNA yet 
undetectable reduction of target protein can indicate that a large reservoir of stable NOVX 
protein may exist in the cell. Multiple transfection in sufficiently long intervals may be 
necessary until the target protein is finally depleted to a point where a phenotype may 

5 become apparent. If multiple transfection steps are required, cells are split 2 to 3 days 
after transfection. The cells may be transfected immediately after splitting. 

An inventive therapeutic method of the invention contemplates administering a 
NOVX siRNA construct as therapy to compensate for increased or aberrant NOVX 
expression or activity. The NOVX ribopolynucleotide is obtained and processed into 

1 0 siRNA fragments, or a NOVX siRNA is synthesized, as described above. The NOVX 

siRNA is administered to cells or tissues using known nucleic acid transfection techniques, 
as described above. A NOVX siRNA specific for a NOVX gene will decrease or 
knockdown NOVX transcription products, which leads to reduced NOVX polypeptide 
production, resulting in reduced NOVX polypeptide activity in the cells or tissues. 

1 5 The present invention also encompasses a method of treating a disease or condition 

associated with the presence of a NOVX protein in an individual comprising administering 
to the individual an RNAi construct that targets the mRNA of the protein (the mRNA that 
encodes the protein) for degradation. A specific RNAi construct includes a siRNA or a 
double stranded gene transcript that is processed into siRNAs. Upon treatment, the target 

20 protein is not produced or is not produced to the extent it would be in the absence of the 
treatment. 

Where the NOVX gene function is not correlated with a known phenotype, a 
control sample of cells or tissues from healthy individuals provides a reference standard 
for determining NOVX expression levels. Expression levels are detected using the assays 

25 described, e.g., RT-PCR, Northern blotting, Western blotting, ELISA, and the like. A 
subject sample of cells or tissues is taken from a mammal, preferably a human subject, 
suffering from a disease state. The NOVX ribopolynucleotide is used to produce siRNA 
constructs, that are specific for the NOVX gene product These cells or tissues are treated 
by administering NOVX siRNA's to the cells or tissues by methods described for the 

30 transfection of nucleic acids into a cell or tissue, and a change in NOVX polypeptide or 

polynucleotide expression is observed in the subject sample relative to the control sample, 

using the assays described. This NOVX gene knockdown approach provides a rapid 
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method for determination of a NOVX minus (NOVX") phenotype in the treated subject 
sample. The NOVX" phenotype observed in the treated subject sample thus serves as a 
marker for monitoring the course of a disease state during treatment. 

In specific embodiments, a NOVX siRNA is used in therapy. Methods for the 
5 generation and use of a NOVX siRNA are known to those skilled in the art. Example 
techniques are provided below. 

Production of RNAs 

Sense RNA (ssRNA) and antisense RNA (asRNA) of NOVX are produced using 
known methods such as transcription in RNA expression vectors. In the initial 

10 experiments, the sense and antisense RNA are about 500 bases in length each. The 

produced ssRNA and asRNA (0.5 nM) in 10 mM Tris-HCl (pH 7.5) with 20 mM NaCl 
were heated to 95° C for 1 min then cooled and annealed at room temperature for 12 to 16 
h. The RNAs are precipitated and resuspended in lysis buffer (below). To monitor 
annealing, RNAs are electrophoresed in a 2% agarose gel in TBE buffer and stained with 

1 5 ethidium bromide. See, e.g., Sambrook et al., Molecular Cloning. Cold Spring Harbor 
Laboratory Press, Plainview, N.Y. (1989). 

Lysate Preparation 

Untreated rabbit reticulocyte lysate (Ambion) are assembled according to the 
manufacturer's directions. dsRNA is incubated in the lysate at 30° C for 10 min prior to the 

20 addition of mRNAs. Then NOVX mRNAs are added and the incubation continued for an 
additional 60 min. The molar ratio of double stranded RNA and mRNA is about 200: 1 . 
The NOVX mRNA is radiolabeled (using known techniques) and its stability is monitored 
by gel electrophoresis. 

In a parallel experiment made with the same conditions, the double stranded RNA 

25 is internally radiolabeled with a 32 P- ATP. Reactions are stopped by the addition of 2 X 
proteinase K buffer and deproteinized as described previously (Tuschl et al. 9 Genes Dev., 
13:3191-3197 (1999)). Products are analyzed by electrophoresis in 15% or 18% 
polyacrylamide sequencing gels using appropriate RNA standards. By monitoring the gels 
for radioactivity, the natural production of 10 to 25 nt RNAs from the double stranded 

30 RNA can be determined. 
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The band of double stranded RNA, about 21-23 bps, is eluded. The efficacy of 
these 21-23 mers for suppressing NOVX transcription is assayed in vitro using the same 
rabbit reticulocyte assay described above using 50 nanomolar of double stranded 21-23 
mer for each assay. The sequence of these 21-23 mers is then determined using standard 
5 nucleic acid sequencing techniques. 

RNA Preparation 

21 nt RNAs, based on the sequence determined above, are chemically synthesized 
using Expedite RNA phosphoramidites and thymidine phosphoramidite (Proligo, 
Germany). Synthetic oligonucleotides are deprotected and gel-purified (Elbashir, 
10 Lendeckel, & Tuschl, Genes & Dev. 15, 188-200 (2001)), followed by Sep-Pak C18 
cartridge (Waters, Milford, Mass., USA) purification (Tuschl, et al., Biochemistry, 
32:11658-11668 (1993)). 

These RNAs (20 pM) single strands are incubated in annealing buffer (100 mM 
potassium acetate, 30 mM HEPES-KOH at pH 7.4, 2 mM magnesium acetate) for 1 min at 
15 90° C followed by 1 h at 37° C. 

Cell Culture 

A cell culture known in the art to regularly express NOVX is propagated using 
standard conditions. 24 hours before transfection, at approx. 80% confluency, the cells are 
trypsinized and diluted 1:5 with fresh medium without antibiotics (1-3 X 105 cells/ml) and 

20 transferred to 24-well plates (500 ml/well). Transfection is performed using a 

commercially available lipofection kit and NOVX expression is monitored using standard 
techniques with positive and negative control. A positive control is cells that naturally 
express NOVX while a negative control is cells that do not express NOVX. Base-paired 
21 and 22 nt siRNAs with overhanging 3* ends mediate efficient sequence-specific mRNA 

25 degradation in lysates and in cell culture. Different concentrations of siRNAs are used. 
An efficient concentration for suppression in vitro in mammalian culture is between 25 
nM to 100 nM final concentration. This indicates that siRNAs are effective at 
concentrations that are several orders of magnitude below the concentrations applied in 
conventional antisense or ribozyme gene targeting experiments. 
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The above method provides a way both for the deduction of NOVX siRNA 
sequence and the use of such siRNA for in vitro suppression. In vivo suppression may be 
performed using the same siRNA using well known in vivo transfection or gene therapy 
transfection techniques. 

5 Antisense Nucleic Acids 

Another aspect of the invention pertains to isolated antisense nucleic acid 
molecules that are hybridizable to or complementary to the nucleic acid molecule 
comprising the nucleotide sequence of SEQ ID NO:2/i-l , wherein n is an integer between 
1 and 12, or fragments, analogs or derivatives thereof. An "antisense" nucleic acid 

10 comprises a nucleotide sequence that is complementary to a "sense" nucleic acid encoding 
a protein (e.g., complementary to the coding strand of a double-stranded cDNA molecule 
or complementary to an mRNA sequence). In specific aspects, antisense nucleic acid 
molecules are provided that comprise a sequence complementary to at least about 10, 25, 
50, 100, 250 or 500 nucleotides or an entire NOVX coding strand, or to only a portion 

1 5 thereof. Nucleic acid molecules encoding fragments, homologs, derivatives and analogs 
of a NOVX protein of SEQ ID NO:2/i, wherein n is an integer between 1 and 12, or 
antisense nucleic acids complementary to a NOVX nucleic acid sequence of SEQ ID 
NO:2n-l, wherein n is an integer between 1 and 12, are additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding 

20 region" of the coding strand of a nucleotide sequence encoding a NOVX protein. The 
term "coding region" refers to the region of the nucleotide sequence comprising codons 
which are translated into amino acid residues. In another embodiment, the antisense 
nucleic acid molecule is antisense to a "noncoding region" of the coding strand of a 
nucleotide sequence encoding the NOVX protein. The term "noncoding region" refers to 

25 5' and 3' sequences which flank the coding region that are not translated into amino acids 
(i.e., also referred to as 5' and 3 f untranslated regions). 

Given the coding strand sequences encoding the NOVX protein disclosed herein, 
antisense nucleic acids of the invention can be designed according to the rules of Watson 
and Crick or Hoogsteen base pairing. The antisense nucleic acid molecule can be 

30 complementary to the entire coding region of NOVX mRNA, but more preferably is an 
oligonucleotide that is antisense to only a portion of the coding or noncoding region of 
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NOVX mRNA. For example, the antisense oligonucleotide can be complementary to the 
region surrounding the translation start site of NOVX mRNA. An antisense 
oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 
nucleotides in length. An antisense nucleic acid of the invention can be constructed using 

5 chemical synthesis or enzymatic ligation reactions using procedures known in the art. For 
example, an antisense nucleic acid (eg., an antisense oligonucleotide) can be chemically 
synthesized using naturally-occurring nucleotides or variously modified nucleotides 
designed to increase the biological stability of the molecules or to increase the physical 
stability of the duplex formed between the antisense and sense nucleic acids (e.g., 

10 phosphorothioate derivatives and acridine substituted nucleotides can be used). 

Examples of modified nucleotides that can be used to generate the antisense 
nucleic acid include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, 
hypoxanthine, xanthine, 4-acetylcytosine, 5-carboxymethylaminomethyl-2-thiouridine, 
5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyluracil, dihydrouracil, 

1 5 beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1 -methylguanine, 
1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 
5-methoxyuracil, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 
5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 2-thiouracil, 
4-thiouracil, beta-D-mannosylqueosine, S'-methoxycarboxymethyluracil, 

20 2-methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, 
pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 5-methyluracil, 
uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 
3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, 
the antisense nucleic acid can be produced biologically using an expression vector into 

25 which a nucleic acid has been subcloned in an antisense orientation (i.e., RNA transcribed 
from the inserted nucleic acid will be of an antisense orientation to a target nucleic acid of 
interest, described further in the following subsection). 

The antisense nucleic acid molecules of the invention are typically administered to 
a subject or generated in situ such that they hybridize with or bind to cellular mRNA 

30 and/or genomic DNA encoding a NOVX protein to thereby inhibit expression of the 

protein (e.g., by inhibiting transcription and/or translation). The hybridization can be by 

conventional nucleotide complementarity to form a stable duplex, or, for example, in the 
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case of an antisense nucleic acid molecule that binds to DNA duplexes, through specific 
interactions in the major groove of the double helix. An example of a route of 
administration of antisense nucleic acid molecules of the invention includes direct 
injection at a tissue site. Alternatively, antisense nucleic acid molecules can be modified 

5 to target selected cells and then administered systemically. For example, for systemic 
administration, antisense molecules can be modified such that they specifically bind to 
receptors or antigens expressed on a selected cell surface (e.g., by linking the antisense 
nucleic acid molecules to peptides or antibodies that bind to cell surface receptors or 
antigens). The antisense nucleic acid molecules can also be delivered to cells using the 

10 vectors described herein. To achieve sufficient nucleic acid molecules, vector constructs 
in which the antisense nucleic acid molecule is placed under the control of a strong pol II 
or pol in promoter are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention is 
an ct-anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms 

1 5 specific double-stranded hybrids with complementary RNA in which, contrary to the usual 
P-units, the strands run parallel to each other. See, e.g., Gaultier, et a/., 1987. Nucl Acids 
Res. 15: 6625-6641. The antisense nucleic acid molecule can also comprise a 
y-o-methylribonucleotide (See, eg., Inoue, etal 1987. NucL Acids Res. 15: 6131-6148) 
or a chimeric RNA-DNA analogue (See, e.g., Inoue, et a/., 1987. FEBS Lett 215: 

20 327-330. 

Ribozymes and PNA Moieties 

Nucleic acid modifications include, by way of non-limiting example, modified 
bases, and nucleic acids whose sugar phosphate backbones are modified or derivatized. 
These modifications are carried out at least in part to enhance the chemical stability of the 

25 modified nucleic acid, such that they may be used, for example, as antisense binding 
nucleic acids in therapeutic applications in a subject. 

In one embodiment, an antisense nucleic acid of the invention is a ribozyme. 
Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of 
cleaving a single-stranded nucleic acid, such as an mRNA, to which they have a 

30 complementary region. Thus, ribozymes (eg., hammerhead ribozymes as described in 
Haselhoff and Gerlach 1988. Nature 334: 585-591) can be used to catalytically cleave 
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NOVX mRNA transcripts to thereby inhibit translation of NOVX mRNA. A ribozyme 
having specificity for a NOVX-encoding nucleic acid can be designed based upon the 
nucleotide sequence of a NOVX cDNA disclosed herein (i.e., SEQ ID NO:2n-l, wherein n 
is an integer between 1 and 12). For example, a derivative of a Tetrahymena L-19 IVS 

5 RNA can be constructed in which the nucleotide sequence of the active site is 

complementary to the nucleotide sequence to be cleaved in a NOVX-encoding mRNA. 
See, e.g., U.S. Patent 4,987,071 to Cech, et al and U.S. Patent 5,1 16,742 to Cech, et al 
NOVX mRNA can also be used to select a catalytic RNA having a specific ribonuclease 
activity from a pool of RNA molecules. See, e.g., Bartel et al, (1993) Science 

10 261:1411-1418. 

Alternatively, NOVX gene expression can be inhibited by targeting nucleotide 
sequences complementary to the regulatory region of the NOVX nucleic acid (e.g., the 
NOVX promoter and/or enhancers) to form triple helical structures that prevent 
transcription of the NOVX gene in target cells. See, e.g., Helene, 1991. Anticancer Drug 

1 5 Des. 6: 569-84; Helene, et al 1 992. Ann. N. Y. Acad. Sci. 660: 27-36; Maher, 1 992. 
Bioassays 14: 807-15. 

In various embodiments, the NOVX nucleic acids can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, hybridization, 
or solubility of the molecule. For example, the deoxyribose phosphate backbone of the 

20 nucleic acids can be modified to generate peptide nucleic acids. See, e.g., Hyrup, et al., 
1996. Bioorg Med Chem 4: 5-23. As used herein, the terms "peptide nucleic acids" or 
"PNAs" refer to nucleic acid mimics (e.g., DNA mimics) in which the deoxyribose 
phosphate backbone is replaced by a pseudopeptide backbone and only the four natural 
nucleotide bases are retained. The neutral backbone of PNAs has been shown to allow for 

25 specific hybridization to DNA and RNA under conditions of low ionic strength. The 

synthesis of PNA oligomer can be performed using standard solid phase peptide synthesis 
protocols as described in Hyrup, et al, 1996. supra; Perry-O'Keefe, et al, 1996. Proc. 
Natl Acad. Sci. USA 93: 14670-14675. 

PNAs of NOVX can be used in therapeutic and diagnostic applications. For 

30 example, PNAs can be used as antisense or antigene agents for sequence-specific 

modulation of gene expression by, e.g., inducing transcription or translation arrest or 

inhibiting replication. PNAs of NOVX can also be used, for example, in die analysis of 
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single base pair mutations in a gene (e.g, PNA directed PCR clamping; as artificial 
restriction enzymes when used in combination with other enzymes, e.g., Si nucleases (See, 
Hynip, et al, I996supra); or as probes or primers for DNA sequence and hybridization 
(See, Hyrup, et al., 1996, supra; Peny-O'Keefe, et al, 1996. supra). 

5 In another embodiment, PNAs of NOVX can be modified, e.g., to enhance their 

stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the 
formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug 
delivery known in the art For example, PNA-DNA chimeras of NOVX can be generated 
that may combine the advantageous properties of PNA and DNA. Such chimeras allow 

10 DNA recognition enzymes (e.g., RNase H and DNA polymerases) to interact with the 
DNA portion while the PNA portion would provide high binding affinity and specificity. 
PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms of 
base stacking, number of bonds between the nucleotide bases, and orientation (see, Hyrup, 
et al., 1996. supra). The synthesis of PNA-DNA chimeras can be performed as described 

15 inHyrup, etal, 1996.swpraandFinn,e/a/., 1996. NuclAcidsRes 24: 3357-3363. For 
example, a DNA chain can be synthesized on a solid support using standard 
phosphoramidite coupling chemistry, and modified nucleoside analogs, e.g., 
S'^-methoxytrityOamino-S'-deoxy-thymidine phosphoramidite, can be used between the 
PNA and the 5 1 end of DNA. See, e.g., Mag, et al, 1989. Nucl Acid Res 17: 5973-5988. 

20 PNA monomers are then coupled in a stepwise manner to produce a chimeric molecule 
with a 5 f PNA segment and a 3 f DNA segment. See, e.g., Finn, et al., 1996. supra. 
Alternatively, chimeric molecules can be synthesized with a 5' DNA segment and a 3* 
PNA segment. See, e.g., Petersen, et al., 1975. Bioorg. Med. Chem. Lett 5: 1 1 19-1 1 124. 
In other embodiments, the oligonucleotide may include other appended groups 

25 such as peptides (e.g. , for targeting host cell receptors in vivo), or agents facilitating 

transport across the cell membrane (see, e.g., Letsinger, et ai, 1989. Proc. Natl. Acad. ScL 
U.S.A. 86: 6553-6556; Lemaitre, et al., 1987. Proc. Natl. Acad. Sci. 84: 648-652; PCT 
Publication No. WO88/09810) or the blood-brain barrier (see, e.g., PCT Publication No. 
WO 89/10134). In addition, oligonucleotides can be modified with hybridization triggered 

30 cleavage agents (see, e.g., Krol, et ai, 1988. BioTechniques 6:958-976) or intercalating 

agents (see, e.g., Zon, 1988. Pharm. Res. 5: 539-549). To this end, the oligonucleotide 

may be conjugated to another molecule, e.g., a peptide, a hybridization triggered 
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cross-linking agent, a transport agent, a hybridization-triggered cleavage agent, and the 
like. 

NOVX Polypeptides 

A polypeptide according to the invention includes a polypeptide including the 

5 amino acid sequence of NOVX polypeptides whose sequences are provided in any one of 
SEQ ID NO:2/i, wherein n is an integer between 1 and 12. The invention also includes a 
mutant or variant protein any of whose residues may be changed from the corresponding 
residues shown in any one of SEQ ID NO:2n, wherein n is an integer between 1 and 12, 
while still encoding a protein that maintains its NOVX activities and physiological 

10 functions, or a functional fragment thereof. 

In general, a NOVX variant that preserves NOVX-like function includes any 
variant in which residues at a particular position in the sequence have been substituted by 
other amino acids, and further include the possibility of inserting an additional residue or 
residues between two residues of the parent protein as well as the possibility of deleting 

15 one or more residues from the parent sequence. Any amino acid substitution, insertion, or 
deletion is encompassed by the invention. In favorable circumstances, the substitution is a 
conservative substitution as defined above. 

One aspect of the invention pertains to isolated NOVX proteins, and 
biologically-active portions thereof, or derivatives, fragments, analogs or homologs 

20 thereof. Also provided are polypeptide fragments suitable for use as immunogens to raise 
anti-NOVX antibodies. In one embodiment, native NOVX proteins can be isolated from 
cells or tissue sources by an appropriate purification scheme using standard protein 
purification techniques. In another embodiment, NOVX proteins are produced by 
recombinant DNA techniques. Alternative to recombinant expression, a NOVX protein or 

25 polypeptide can be synthesized chemically using standard peptide synthesis techniques. 

An "isolated" or "purified" polypeptide or protein or biologically-active portion 
thereof is substantially free of cellular material or other contaminating proteins from the 
cell or tissue source from which die NOVX protein is derived, or substantially free from 
chemical precursors or other chemicals when chemically synthesized. The language 

30 "substantially free of cellular material" includes preparations of NOVX proteins in which 
the protein is separated from cellular components of the cells from which it is isolated or 
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recombinantly-produced. In one embodiment, the language "substantially free of cellular 
material" includes preparations of NOVX proteins having less than about 30% (by dry 
weight) of non-NOVX proteins (also referred to herein as a "contaminating protein"), 
more preferably less than about 20% of non-NOVX proteins, still more preferably less 

5 than about 10% of non-NOVX proteins, and most preferably less than about 5% of 
non-NOVX proteins. When the NOVX protein or biologically-active portion thereof is 
recombinantly-produced, it is also preferably substantially free of culture medium, i.e., 
culture medium represents less than about 20%, more preferably less than about 10%, and 
most preferably less than about 5% of the volume of the NOVX protein preparation. 

10 The language "substantially free of chemical precursors or other chemicals" 

includes preparations of NOVX proteins in which the protein is separated from chemical 
precursors or other chemicals that are involved in the synthesis of the protein. In one 
embodiment, the language "substantially free of chemical precursors or other chemicals" 
includes preparations of NOVX proteins having less than about 30% (by dry weight) of 

1 5 chemical precursors or non-NOVX chemicals, more preferably less than about 20% 

chemical precursors or non-NOVX chemicals, still more preferably less than about 10% 
chemical precursors or non-NOVX chemicals, and most preferably less than about 5% 
chemical precursors or non-NOVX chemicals. 

Biologically-active portions of NOVX proteins include peptides comprising amino 

20 acid sequences sufficiently homologous to or derived from the amino acid sequences of 
the NOVX proteins (e.g., the amino acid sequence of SEQ ID NO:2«, wherein n is an 
integer between 1 and 12) that include fewer amino acids than the full-length NOVX 
proteins, and exhibit at least one activity of a NOVX protein. Typically, 
biologically-active portions comprise a domain or motif with at least one activity of the 

25 NOVX protein. A biologically-active portion of a NOVX protein can be a polypeptide 
which is, for example, 10, 25, 50, 100 or more amino acid residues in length. 

Moreover, other biologically-active portions, in which other regions of the protein 
are deleted, can be prepared by recombinant techniques and evaluated for one or more of 
the functional activities of a native NOVX protein. 

30 In an embodiment, the NOVX protein has an amino acid sequence of SEQ ID 

NO:2/i, wherein n is an integer between 1 and 12. In other embodiments, the NOVX 

protein is substantially homologous to SEQ ID NO:2n, wherein n is an integer between 1 
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and 12, and retains the functional activity of the protein of SEQ ED NO:2w, wherein n is an 
integer between 1 and 12, yet differs in amino acid sequence due to natural allelic 
variation or mutagenesis, as described in detail, below. Accordingly, in another 
embodiment, the NOVX protein is a protein that comprises an amino acid sequence at 
5 least about 45% homologous to the amino acid sequence of SEQ ID NO:2«, wherein n is 
an integer between 1 and 12, and retains the functional activity of the NOVX proteins of 
SEQ ID NO:2/i, wherein n is an integer between 1 and 12. 

Determining Homology Between Two or More Sequences 

To determine the percent homology of two amino acid sequences or of two nucleic 

10 acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can be 
introduced in the sequence of a first amino acid or nucleic acid sequence for optimal 
alignment with a second amino or nucleic acid sequence). The amino acid residues or 
nucleotides at corresponding amino acid positions or nucleotide positions are then 
compared. When a position in the first sequence is occupied by the same amino acid 

1 5 residue or nucleotide as the corresponding position in the second sequence, then the 

molecules are homologous at that position as used herein amino acid or nucleic acid 
"homology" is equivalent to amino acid or nucleic acid "identity"). 

The nucleic acid sequence homology may be determined as the degree of identity 
between two sequences. The homology may be determined using computer programs 

20 known in the art, such as GAP software provided in the GCG program package. See, 

Needleman and Wunsch, 1970. JMolBiol 48: 443-453. Using GCG GAP software with 
the following settings for nucleic acid sequence comparison: GAP creation penalty of 5.0 
and GAP extension penalty of 0.3, the coding region of the analogous nucleic acid 
sequences referred to above exhibits a degree of identity preferably of at least 70%, 75%, 

25 80%, 85%, 90%, 95%, 98%, or 99%, with the CDS (encoding) part of the DNA sequence 
of SEQ ID NO:2/i-l, wherein n is an integer between 1 and 12. 

The term "sequence identity" refers to the degree to which two polynucleotide or 
polypeptide sequences are identical on a residue-by-residue basis over a particular region 
of comparison. The term "percentage of sequence identity" is calculated by comparing 

30 two optimally aligned sequences over that region of comparison, determining the number 
of positions at which the identical nucleic acid base {e.g., A, T, C, G, U, or I, in the case of 
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nucleic acids) occurs in both sequences to yield the number of matched positions, dividing 
the number of matched positions by the total number of positions in the region of 
comparison {i.e., the window size), and multiplying the result by 100 to yield the 
percentage of sequence identity. The term "substantial identity" as used herein denotes a 
5 characteristic of a polynucleotide sequence, wherein the polynucleotide comprises a 
sequence that has at least 80 percent sequence identity, preferably at least 85 percent 
identity and often 90 to 95 percent sequence identity, more usually at least 99 percent 
sequence identity as compared to a reference sequence over a comparison region. 

Chimeric and Fusion Proteins 

10 The invention also provides NOVX chimeric or fusion proteins. As used herein, a 

NOVX "chimeric protein" or "fusion protein" comprises a NOVX polypeptide 
operatively-linked to a non-NOVX polypeptide. An "NOVX polypeptide" refers to a 
polypeptide having an amino acid sequence corresponding to a NOVX protein of SEQ ID 
NO:2w, wherein n is an integer between 1 and 12, whereas a "non-NOVX polypeptide" 

1 5 refers to a polypeptide having an amino acid sequence corresponding to a protein that is 
not substantially homologous to the NOVX protein, e.g., a protein that is different from 
the NOVX protein and that is derived from the same or a different organism. Within a 
NOVX fusion protein the NOVX polypeptide can correspond to all or a portion of a 
NOVX protein. In one embodiment, a NOVX fusion protein comprises at least one 

20 biologically-active portion of a NOVX protein. In another embodiment, a NOVX fusion 
protein comprises at least two biologically-active portions of a NOVX protein. In yet 
another embodiment, a NOVX fusion protein comprises at least three biologically-active 
portions of a NOVX protein. Within the fusion protein, the term "operatively-linked" is 
intended to indicate that the NOVX polypeptide and the non-NOVX polypeptide are fused 

25 in-frame with one another. The non-NOVX polypeptide can be fused to the N-terminus or 
C-terminus of the NOVX polypeptide. 

In one embodiment, the fusion protein is a GST-NO VX fusion protein in which the 
NOVX sequences are fused to the C-terminus of the GST (glutathione S-transferase) 
sequences. Such fusion proteins can facilitate the purification of recombinant NOVX 

30 polypeptides. 
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In another embodiment, the fusion protein is a NOVX protein containing a 
heterologous signal sequence at its N-terminus. In certain host cells (e.g., mammalian host 
cells), expression and/or secretion of NOVX can be increased through use of a 
heterologous signal sequence. 

5 In yet another embodiment, the fusion protein is a NOVX-immunoglobulin fusion 

protein in which the NOVX sequences are fused to sequences derived from a member of 
the immunoglobulin protein family. The NOVX-immunoglobulin fusion proteins of the 
invention can be incorporated into pharmaceutical compositions and administered to a 
subject to inhibit an interaction between a NOVX ligand and a NOVX protein on the 

10 surface of a cell, to thereby suppress NOVX-mediated signal transduction in vivo. The 
NOVX-immunoglobulin fusion proteins can be used to affect the bioavailability of a 
NOVX cognate ligand. Inhibition of the NOVX ligand/NOVX interaction may be useful 
therapeutically for both the treatment of proliferative and differentiative disorders, as well 
as modulating (e.g. promoting or inhibiting) cell survival. Moreover, the 

1 5 NOVX-immunoglobulin fusion proteins of the invention can be used as immunogens to 
produce anti-NOVX antibodies in a subject, to purify NOVX ligands, and in screening 
assays to identify molecules that inhibit the interaction of NOVX with a NOVX ligand. 

A NOVX chimeric or fusion protein of the invention can be produced by standard 
recombinant DNA techniques. For example, DNA fragments coding for the different 

20 polypeptide sequences are ligated together in-frame in accordance with conventional 
techniques, by employing blunt-ended or stagger-ended termini for ligation, 
restriction enzyme digestion to provide for appropriate termini, filling-in of cohesive ends 
as appropriate, alkaline phosphatase treatment to avoid undesirable joining, and enzymatic 
ligation. In another embodiment, the fusion gene can be synthesized by conventional 

25 techniques including automated DNA synthesizers. Alternatively, PCR amplification of 
gene fragments can be carried out using anchor primers that give rise to complementary 
overhangs between two consecutive gene fragments that can subsequently be annealed and 
reamplified to generate a chimeric gene sequence (see, e.g., Ausubel, et al. (eds.) 
Current Protocols in Molecular Biology, John Wiley & Sons, 1 992). Moreover, 

30 many expression vectors are commercially available that already encode a fusion moiety 

(eg., a GST polypeptide). A NOVX-encoding nucleic acid can be cloned into such an 

expression vector such that the fusion moiety is linked in-frame to the NOVX protein. 
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NOVX Agonists and Antagonists 

The invention also pertains to variants of the NOVX proteins that function as either 
NOVX agonists (i.e., mimetics) or as NOVX antagonists. Variants of the NOVX protein 
can be generated by mutagenesis (e.g., discrete point mutation or truncation of the NOVX 

5 protein). An agonist of the NOVX protein can retain substantially the same, or a subset of, 
the biological activities of the naturally occurring form of the NOVX protein. An 
antagonist of the NOVX protein can inhibit one or more of the activities of the naturally 
occurring form of the NOVX protein by, for example, competitively binding to a 
downstream or upstream member of a cellular signaling cascade which includes the 

10 NOVX protein. Thus, specific biological effects can be elicited by treatment with a 
variant of limited function. In one embodiment, treatment of a subject with a variant 
having a subset of the biological activities of the naturally occurring form of the protein 
has fewer side effects in a subject relative to treatment with the naturally occurring form of 
the NOVX proteins. 

15 Variants of the NOVX proteins that function as either NOVX agonists (i.e., 

mimetics) or as NOVX antagonists can be identified by screening combinatorial libraries 
of mutants (e.g. , truncation mutants) of the NOVX proteins for NOVX protein agonist or 
antagonist activity. In one embodiment, a variegated library of NOVX variants is 
generated by combinatorial mutagenesis at the nucleic acid level and is encoded by a 

20 variegated gene library. A variegated library of NOVX variants can be produced by, for 
example, enzymatically ligating a mixture of synthetic oligonucleotides into gene 
sequences such that a degenerate set of potential NOVX sequences is expressible as 
individual polypeptides, or alternatively, as a set of larger fusion proteins (eg., for phage 
display) containing the set of NOVX sequences therein. There are a variety of methods 

25 which can be used to produce libraries of potential NOVX variants from a degenerate 
oligonucleotide sequence. Chemical synthesis of a degenerate gene sequence can be 
performed in an automatic DNA synthesizer, and the synthetic gene then ligated into an 
appropriate expression vector. Use of a degenerate set of genes allows for the provision, 
in one mixture, of all of the sequences encoding the desired set of potential NOVX 

30 sequences. Methods for synthesizing degenerate oligonucleotides are well-known within 
the art See, e.g., Narang, 1983. Tetrahedron 39: 3; Itakura, et a/., 1984. Annu. Rev. 
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Biochem. 53: 323; Itakura, et al. 9 1984. Science 198: 1056; Dee, et al 9 1983. Nucl Acids 
Res.W.All. 

Polypeptide Libraries 

In addition, libraries of fragments of the NOVX protein coding sequences can be 

5 used to generate a variegated population of NOVX fragments for screening and 

subsequent selection of variants of a NOVX protein. In one embodiment, a library of 
coding sequence fragments can be generated by treating a double stranded PCR fragment 
of a NOVX coding sequence with a nuclease under conditions wherein nicking occurs 
only about once per molecule, denaturing the double stranded DNA, renaturing the DNA 

10 to form double-stranded DNA that can include sense/antisense pairs from different nicked 
products, removing single stranded portions from reformed duplexes by treatment with Sj 
nuclease, and ligating the resulting fragment library into an expression vector. By this 
method, expression libraries can be derived which encodes N-terminal and internal 
fragments of various sizes of the NOVX proteins. 

1 5 Various techniques are known in the art for screening gene products of 

combinatorial libraries made by point mutations or truncation, and for screening cDNA 
libraries for gene products having a selected property. Such techniques are adaptable for 
rapid screening of the gene libraries generated by the combinatorial mutagenesis of 
NOVX proteins. The most widely used techniques, which are amenable to high 

20 throughput analysis, for screening large gene libraries typically include cloning the gene 
library into replicable expression vectors, transforming appropriate cells with the resulting 
library of vectors, and expressing the combinatorial genes under conditions in which 
detection of a desired activity facilitates isolation of the vector encoding the gene whose 
product was detected. Recursive ensemble mutagenesis (REM), a new technique that 

25 enhances the frequency of functional mutants in the libraries, can be used in combination 
with the screening assays to identify NOVX variants. See, e.g., Arkin and Yourvan, 1992. 
Proc. Natl. Acad. Sci. USA 89: 781 1-7815; Delgrave, et al. 9 1993. Protein Engineering 
6:327-331. 
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Anti-NOVX Antibodies 

Included in the invention are antibodies to NOVX proteins, or fragments of NOVX 
proteins. The term "antibody" as used herein refers to immunoglobulin molecules and 
immunologically active portions of immunoglobulin (Ig) molecules, i.e., molecules that 

5 contain an antigen binding site that specifically binds (immunoreacts with) an antigen. 
Such antibodies include, but are not limited to, polyclonal, monoclonal, chimeric, single 
chain, Fab, Fab 1 and fragments, and an Fab expression library. In general, antibody 
molecules obtained from humans relates to any of the classes IgG, IgM, IgA, IgE and IgD, 
which differ from one another by the nature of the heavy chain present in the molecule. 

10 Certain classes have subclasses as well, such as IgGi, IgG2, and others. Furthermore, in 
humans, the light chain may be a kappa chain or a lambda chain. Reference herein to 
antibodies includes a reference to all such classes, subclasses and types of human antibody 
species. 

An isolated protein of the invention intended to serve as an antigen, or a portion or 

1 5 fragment thereof, can be used as an immunogen to generate antibodies that 

immunospecifically bind the antigen, using standard techniques for polyclonal and 
monoclonal antibody preparation. The full-length protein can be used or, alternatively, the 
invention provides antigenic peptide fragments of the antigen for use as immunogens. An 
antigenic peptide fragment comprises at least 6 amino acid residues of the amino acid 

20 sequence of the full length protein, such as an amino acid sequence of SEQ ID NO:2/i, 

wherein n is an integer between 1 and 12, and encompasses an epitope thereof such that an 
antibody raised against the peptide forms a specific immune complex with the full length 
protein or with any fragment that contains the epitope. Preferably, the antigenic peptide 
comprises at least 10 amino acid residues, or at least IS amino acid residues, or at least 20 

25 amino acid residues, or at least 30 amino acid residues. Preferred epitopes encompassed 
by the antigenic peptide are regions of the protein that are located on its surface; 
commonly these are hydrophilic regions. 

In certain embodiments of the invention, at least one epitope encompassed by the 
antigenic peptide is a region of NOVX that is located on the surface of the protein, e.g., a 

30 hydrophilic region. A hydrophobicity analysis of the human NOVX protein sequence 
indicates which regions of a NOVX polypeptide are particularly hydrophilic and, 
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therefore, are likely to encode surface residues useful for targeting antibody production. 
As a means for targeting antibody production, hydropathy plots showing regions of 
hydrophilicity and hydrophobicity may be generated by any method well known in the art, 
including, for example, the Kyte Doolittle or the Hopp Woods methods, either with or 

5 without Fourier transformation. See, e.g., Hopp and Woods, 1981 , Proc. Nat. Acad. Sci. 
USA 78: 3824-3828; Kyte and Doolittle 1982, /. Mol Biol 157: 105-142, each 
incorporated herein by reference in their entirety. Antibodies that are specific for one or 
more domains within an antigenic protein, or derivatives, fragments, analogs or homologs 
thereof, are also provided herein. 

10 The term "epitope" includes any protein determinant capable of specific binding to 

an immunoglobulin or T-cell receptor. Epitopic determinants usually consist of 
chemically active surface groupings of molecules such as amino acids or sugar side chains 
and usually have specific three dimensional structural characteristics, as well as specific 
charge characteristics. A NOVX polypeptide or a fragment thereof comprises at least one 

15 antigenic epitope. An anti-NOVX antibody of the present invention is said to specifically 
bind to antigen NOVX when the equilibrium binding constant (Kd) is £1 ^M, preferably £ 
100 nM, more preferably £ 10 nM, and most preferably £ 100 pM to about 1 pM, as 
measured by assays such as radioligand binding assays or similar assays known to those 
skilled in the art 

20 A protein of the invention, or a derivative, fragment, analog, homolog or ortholog 

thereof, may be utilized as an immunogen in the generation of antibodies that 
immunospecifically bind these protein components. 

Various procedures known within the art may be used for the production of 
polyclonal or monoclonal antibodies directed against a protein of the invention, or against 

25 derivatives, fragments, analogs homologs or orthologs thereof (see, for example, 

Antibodies: A Laboratory Manual, Harlow E, and Lane D, 1988, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, incorporated herein by reference). Some of 
these antibodies are discussed below. 

Polyclonal Antibodies 

30 For the production of polyclonal antibodies, various suitable host animals {e.g. , 

rabbit, goat, mouse or other mammal) may be immunized by one or more injections with 

75 



WO 03/080856 



PCT/US03/08490 



the native protein, a synthetic variant thereof, or a derivative of the foregoing. An 
appropriate immunogenic preparation can contain, for example, the naturally occurring 
immunogenic protein, a chemically synthesized polypeptide representing the 
immunogenic protein, or a recombinantly expressed immunogenic protein. Furthermore, 

5 the protein may be conjugated to a second protein known to be immunogenic in the 

mammal being immunized. Examples of such immunogenic proteins include but are not 
limited to keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, and soybean 
trypsin inhibitor. The preparation can further include an adjuvant. Various adjuvants used 
to increase the immunological response include, but are not limited to, Freund's (complete 

10 and incomplete), mineral gels (e.g., aluminum hydroxide), surface active substances (eg., 
lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, dinitrophenol, etc), 
adjuvants usable in humans such as Bacille Calmette-Guerin and Corynebacterium 
parvum, or similar immunostimulatory agents. Additional examples of adjuvants which 
can be employed include MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic 

1 5 trehalose dicorynomycolate). 

The polyclonal antibody molecules directed against the immunogenic protein can 
be isolated from the mammal (e.g., from the blood) and further purified by well known 
techniques, such as affinity chromatography using protein A or protein G, which provide 
primarily the IgG fraction of immune serum. Subsequently, or alternatively, the specific 

20 antigen which is the target of the immunoglobulin sought, or an epitope thereof, may be 
immobilized on a column to purify the immune specific antibody by immunoaffinity 
chromatography. Purification of immunoglobulins is discussed, for example, by D. 
Wilkinson (The Scientist, published by The Scientist, Inc., Philadelphia PA, Vol. 14, No. 8 
(April 17, 2000), pp. 25-28). 

25 Monoclonal Antibodies 

The term "monoclonal antibody" (MAb) or "monoclonal antibody composition", 
as used herein, refers to a population of antibody molecules that contain only one 
molecular species of antibody molecule consisting of a unique light chain gene product 
and a unique heavy chain gene product. In particular, the complementarity determining 
30 regions (CDRs) of the monoclonal antibody are identical in all the molecules of the 



76 



WO 03/080856 



PCT/US03/08490 



population. MAbs thus contain an antigen binding site capable of immunoreacting with a 
particular epitope of the antigen characterized by a unique binding affinity for it. 

Monoclonal antibodies can be prepared using hybridoma methods, such as those 
described by Kohler and Milstein, Nature, 256:495 (1975). In a hybridoma method, a 

5 mouse, hamster, or other appropriate host animal, is typically immunized with an 
immunizing agent to elicit lymphocytes that produce or are capable of producing 
antibodies that specifically bind to the immunizing agent. Alternatively, the lymphocytes 
can be immunized in vitro. 

The immunizing agent typically includes the protein antigen, a fragment thereof or 

10 a fusion protein thereof. Generally, either peripheral blood lymphocytes are used if cells 
of human origin are desired, or spleen cells or lymph node cells are used if non-human 
mammalian sources are desired. The lymphocytes are then fused with an immortalized 
cell line using a suitable fusing agent, such as polyethylene glycol, to form a hybridoma 
cell (Goding, Monoclonal Antibodies: Principles and Practice. Academic Press, (1986) pp. 

1 5 59-103). Immortalized cell lines are usually transformed mammalian cells, particularly 
myeloma cells of rodent, bovine and human origin. Usually, rat or mouse myeloma cell 
lines are employed. The hybridoma cells can be cultured in a suitable culture medium that 
preferably contains one or more substances that inhibit the growth or survival of the 
unfused, immortalized cells. For example, if the parental cells lack the enzyme 

20 hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium 
for the hybridomas typically includes hypoxanthine, aminopterin, and thymidine ("HAT 
medium"), which substances prevent the growth of HGPRT-deficient cells. 

Preferred immortalized cell lines are those that fuse efficiently, support stable high 
level expression of antibody by the selected antibody-producing cells, and are sensitive to 

25 a medium such as HAT medium. More preferred immortalized cell lines are murine 
myeloma lines, which can be obtained, for instance, from the Salk Institute Cell 
Distribution Center, San Diego, California and the American Type Culture Collection, 
Manassas, Virginia. Human myeloma and mouse-human heteromyeloma cell lines also 
have been described for the production of human monoclonal antibodies (Kozbor, J. 

30 Immunol., 1 33:3001 (1984); Brodeur et al., Monoclonal Antibody Production Techniques 
and Applications, Marcel Dekker, Inc., New York, (1987) pp. 51-63). 
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The culture medium in which the hybridoma cells are cultured can then be assayed 
for the presence of monoclonal antibodies directed against the antigen. Preferably, the 
binding specificity of monoclonal antibodies produced by the hybridoma cells is 
determined by immunoprecipitation or by an in vitro binding assay, such as 

5 radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELISA). Such 
techniques and assays are known in the art The binding affinity of the monoclonal 
antibody can, for example, be determined by the Scatchard analysis of Munson and 
Pollard, Anal. Biochem., 107:220 (1980). It is an objective, especially important in 
therapeutic applications of monoclonal antibodies, to identify antibodies having a high 

10 degree of specificity and a high binding affinity for the target antigen. 

After the desired hybridoma cells are identified, the clones can be subcloned by 
limiting dilution procedures and grown by standard methods (Goding,1986). Suitable 
culture media for this purpose include, for example, Dulbecco's Modified Eagle's Medium 
and RPMI-1640 medium. Alternatively, the hybridoma cells can be grown in vivo as 

1 5 ascites in a mammal. 

The monoclonal antibodies secreted by the subclones can be isolated or purified 
from the culture medium or ascites fluid by conventional immunoglobulin purification 
procedures such as, for example, protein A-Sepharose, hydroxylapatite chromatography, 
gel electrophoresis, dialysis, or affinity chromatography. 

20 The monoclonal antibodies can also be made by recombinant DNA methods, such 

as those described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal 
antibodies of the invention can be readily isolated and sequenced using conventional 
procedures (e.g., by using oligonucleotide probes that are capable of binding specifically 
to genes encoding the heavy and light chains of murine antibodies). The hybridoma cells 

25 of the invention serve as a preferred source of such DNA. Once isolated, the DNA can be 
placed into expression vectors, which are then transfected into host cells such as simian 
COS cells, Chinese hamster ovary (CHO) cells, or myeloma cells that do not otherwise 
produce immunoglobulin protein, to obtain the synthesis of monoclonal antibodies in the 
recombinant host cells. The DNA also can be modified, for example, by substituting the 

30 coding sequence for human heavy and light chain constant domains in place of the 

homologous murine sequences (U.S. Patent No. 4,816,567; Morrison, Nature 368, 812-13 

(1994)) or by coyalently joining to the immunoglobulin coding sequence all or part of the 
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coding sequence for a non-immunoglobulin polypeptide. Such a non-immunoglobulin 
polypeptide can be substituted for the constant domains of an antibody of the invention, or 
can be substituted for the variable domains of one antigen-combining site of an antibody 
of the invention to create a chimeric bivalent antibody. 

5 Humanized Antibodies 

The antibodies directed against the protein antigens of the invention can further 
comprise humanized antibodies or human antibodies. These antibodies are suitable for 
administration to humans without engendering an immune response by the human against 
the administered immunoglobulin. Humanized forms of antibodies are chimeric 

10 immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab 1 , 

F(ab f )2 or other antigen-binding subsequences of antibodies) that are principally comprised 
of the sequence of a human immunoglobulin, and contain minimal sequence derived from 
a non-human immunoglobulin. Humanization can be performed following the method of 
Winter and co-workers (Jones et al., Nature, 321:522-525 (1986); Riechmann et al., 

15 Nature, 332:323-327 (1988); Verhoeyen et al., Science, 239:1534-1536 (1988)), by 

substituting rodent CDRs or CDR sequences for the corresponding sequences of a human 
antibody. (See also U.S. Patent No. 5,225,539.) In some instances, Fv framework 
residues of the human immunoglobulin are replaced by corresponding non-human 
residues. Humanized antibodies can also comprise residues which are found neither in the 

20 recipient antibody nor in the imported CDR or framework sequences. In general, the 

humanized antibody comprises substantially all of at least one, and typically two, variable 
domains, in which all or substantially all of the CDR regions correspond to those of a 
non-human immunoglobulin and all or substantially all of the framework regions are those 
of a human immunoglobulin consensus sequence. The humanized antibody optimally also 

25 comprises at least a portion of an immunoglobulin constant region (Fc), typically that of a 
human immunoglobulin (Jones et al., 1986; Riechmann et al., 1988; and Presta, Curr. Op. 
Struct. Biol., 2:593-596 (1992)). 

Human Antibodies 

Fully human antibodies essentially relate to antibody molecules in which the entire 
30 sequence of both the light chain and the heavy chain, including the CDRs, arise from 
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human genes. Such antibodies are termed "human antibodies", or "fully human 
antibodies" herein. Human monoclonal antibodies can be prepared by the trioma 
technique; the human B-cell hybridoma technique (see Kozbor, et al., 1983 Immunol 
Today 4: 72) and the EBV hybridoma technique to produce human monoclonal antibodies 

5 (see Cole, et al., 1985 In: Monoclonal Antibodies and Cancer Therapy, Alan R. 
Liss, Inc., pp. 77-96). Human monoclonal antibodies may be utilized in the practice of the 
present invention and may be produced by using human hybridomas (see Cote, et al., 
1983. Proc Natl Acad Sci USA 80: 2026-2030) or by transforming human B-cells with 
Epstein Barr Virus in vitro (see Cole, et al., 1985 In: MONOCLONAL ANTIBODIES AND 

10 Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

In addition, human antibodies can also be produced using additional techniques, 
including phage display libraries (Hoogenboom and Winter, J. Mol. Biol., 227:381 (1991); 
Marks et al., J. Mol. Biol., 222:581 (1991)). Similarly, human antibodies can be made by 
introducing human immunoglobulin loci into transgenic animals, e.g., mice in which the 

1 5 endogenous immunoglobulin genes have been partially or completely inactivated. Upon 
challenge, human antibody production is observed, which closely resembles that seen in 
humans in all respects, including gene rearrangement, assembly, and antibody repertoire. 
This approach is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 
5,569,825; 5,625,126; 5,633,425; 5,661,016, and in Marks et al. (Bio/Technology 10, 

20 779-783 (1992)); Lonberg et al. (Nature 368 856-859 (1994)); Morrison ( Nature 368, 
812-13 (1994)); Fishwild et al,( Nature Biotechnology 14, 845-51 (1996)); Neuberger 
(Nature Biotechnology 14, 826 (1996)); and Lonberg and Huszar (Intern. Rev. Immunol. 
13 65-93 (1995)). 

Human antibodies may additionally be produced using transgenic nonhuman 

25 animals which are modified so as to produce fully human antibodies rather than the 

animal's endogenous antibodies in response to challenge by an antigen. (See PCT 

publication WO94/02602). The endogenous genes encoding the heavy and light 

immunoglobulin chains in the nonhuman host have been incapacitated, and active loci 

encoding human heavy and light chain immunoglobulins are inserted into the host's 

30 genome. The human genes are incorporated, for example, using yeast artificial 

chromosomes containing the requisite human DNA segments. An animal which provides 

all the desired modifications is then obtained as progeny by crossbreeding intermediate 
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transgenic animals containing fewer than the full complement of the modifications. The 
preferred embodiment of such a nonhuman animal is a mouse, and is termed the 
Xenomouse™ as disclosed in PCT publications WO 96/33735 and WO 96/34096. This 
animal produces B cells which secrete fully human immunoglobulins. The antibodies can 

5 be obtained directly from the animal after immunization with an immunogen of interest, 
as, for example, a preparation of a polyclonal antibody, or alternatively from immortalized 
B cells derived from the animal, such as hybridomas producing monoclonal antibodies. 
Additionally, the genes encoding the immunoglobulins with human variable regions can 
be recovered and expressed to obtain the antibodies directly, or can be further modified to 

10 obtain analogs of antibodies such as, for example, single chain Fv molecules. 

An example of a method of producing a nonhuman host, exemplified as a mouse, 
lacking expression of an endogenous immunoglobulin heavy chain is disclosed in U.S. 
Patent No. 5,939,598. It can be obtained by a method including deleting the J segment 
genes from at least one endogenous heavy chain locus in an embryonic stem cell to 

1 5 prevent rearrangement of the locus and to prevent formation of a transcript of a rearranged 
immunoglobulin heavy chain locus, the deletion being effected by a targeting vector 
containing a gene encoding a selectable marker, and producing from the embryonic stem 
cell a transgenic mouse whose somatic and germ cells contain the gene encoding the 
selectable marker. 

20 A method for producing an antibody of interest, such as a human antibody, is 

disclosed in U.S. Patent No. 5,916,771. It includes introducing an expression vector that 
contains a nucleotide sequence encoding a heavy chain into one mammalian host cell in 
culture, introducing an expression vector containing a nucleotide sequence encoding a 
light chain into another mammalian host cell, and fusing the two cells to form a hybrid 

25 cell. The hybrid cell expresses an antibody containing the heavy chain and the light chain. 

In a further improvement on this procedure, a method for identifying a clinically 
relevant epitope on an immunogen, and a correlative method for selecting an antibody that 
binds immunospecifically to the relevant epitope with high affinity, are disclosed in PCT 
publication WO 99/53049. 
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F ab Fragments and Single Chain Antibodies 

According to the invention, techniques can be adapted for the production of 
single-chain antibodies specific to an antigenic protein of the invention (see eg., U.S. 
Patent No. 4,946,778). In addition, methods can be adapted for the construction of F ab 

5 expression libraries (see e.g., Huse, et al., 1989 Science 246: 1275-1281) to allow rapid 
and effective identification of monoclonal Fab fragments with the desired specificity for a 
protein or derivatives, fragments, analogs or homologs thereof. Antibody fragments that 
contain the idiotypes to a protein antigen may be produced by techniques known in the art 
including, but not limited to: (i) an F^ fragment produced by pepsin digestion of an 

10 antibody molecule; (ii) an Fab fragment generated by reducing the disulfide bridges of an 
F(ab , )2 fragment; (iii) an F a b fragment generated by the treatment of the antibody molecule 
with papain and a reducing agent and (iv) F v fragments. 

Bispecifie Antibodies 

Bispecific antibodies are monoclonal, preferably human or humanized, antibodies 
15 that have binding specificities for at least two different antigens. In the present case, one 
of the binding specificities is for an antigenic protein of the invention. The second binding 
target is any other antigen, and advantageously is a cell-surface protein or receptor or 
receptor subunit. 

Methods for making bispecific antibodies are known in the art. Traditionally, the 
20 recombinant production of bispecific antibodies is based on the co-expression of two 

immunoglobulin heavy-chain/light-chain pairs, where the two heavy chains have different 
specificities (Milstein and Cuello, Nature, 305:537-539 (1983)). Because of the random 
assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) 
produce a potential mixture of ten different antibody molecules, of which only one has the 
25 correct bispecific structure. The purification of the correct molecule is usually 

accomplished by affinity chromatography steps. Similar procedures are disclosed in WO 
93/08829, published 13 May 1993, and in Traunecker et al., EMBO J., 10:3655-3659 
(1991). 

Antibody variable domains with the desired binding specificities 
30 (antibody-antigen combining sites) can be fused to immunoglobulin constant domain 
sequences. The fusion preferably is with an immunoglobulin heavy-chain constant 
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domain, comprising at least part of the hinge, CH2, and CH3 regions. It is preferred to 
have the first heavy-chain constant region (CHI) containing the site necessary for 
light-chain binding present in at least one of the fusions. DNAs encoding the 
immunoglobulin heavy-chain fusions and, if desired, the immunoglobulin light chain, are 

5 inserted into separate expression vectors, and are co-transfected into a suitable host 

organism. For further details of generating bispecific antibodies see, for example, Suresh 
et al., Methods in Enzymology, 121:210 (1986). 

According to another approach described in WO 96/2701 1 , the interface between a 
pair of antibody molecules can be engineered to maximize the percentage of heterodimers 

10 which are recovered from recombinant cell culture. The preferred interface comprises at 
least a part of the CH3 region of an antibody constant domain. In this method, one or 
more small amino acid side chains from the interface of the first antibody molecule are 
replaced with larger side chains (e.g. tyrosine or tryptophan). Compensatory "cavities" of 
identical or similar size to the large side chain(s) are created on the interface of the second 

15 antibody molecule by replacing large amino acid side chains with smaller ones (e.g. 
alanine or threonine). This provides a mechanism for increasing the yield of the 
heterodimer over other unwanted end-products such as homodimers. 

Bispecific antibodies can be prepared as full length antibodies or antibody 
fragments (e.g. F(ab') 2 bispecific antibodies). Techniques for generating bispecific 

20 antibodies from antibody fragments have been described in the literature. For example, 
bispecific antibodies can be prepared using chemical linkage. Brennan et al., Science 
229:81 (1985) describe a procedure wherein intact antibodies are proteolytically cleaved to 
generate F(ab')2 fragments. These fragments are reduced in the presence of the dithiol 
complexing agent sodium arsenite to stabilize vicinal dithiols and prevent intermolecular 

25 disulfide formation. The Fab' fragments generated are then converted to thionitrobenzoate 
(TNB) derivatives. One of the Fab'-TNB derivatives is then reconverted to the Fab'-thiol 
by reduction with mercaptoethylamine and is mixed with an equimolar amount of the 
other Fab'-TNB derivative to form the bispecific antibody. The bispecific antibodies 
produced can be used as agents for the selective immobilization of enzymes. 

30 Additionally, Fab' fragments can be directly recovered from E. coli and chemically 

coupled to form bispecific antibodies. Shalaby et al., J. Exp. Med. 175:217-225 (1992) 

describe the production of a fully humanized bispecific antibody F(ab*)2 molecule. Each 
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Fab' fragment was separately secreted from E. coli and subjected to directed chemical 
coupling in vitro to form the bispecific antibody. The bispecific antibody thus formed was 
able to bind to cells overexpressing the ErbB2 receptor and normal human T cells, as well 
as trigger the lytic activity of human cytotoxic lymphocytes against human breast tumor 
5 targets. 

Various techniques for making and isolating bispecific antibody fragments 
directly from recombinant cell culture have also been described. For example, bispecific 
antibodies have been produced using leucine zippers. Kostelny et al., J. Immunol. 
148(5): 1547-1 553 (1992). The leucine zipper peptides from the Fos and Jun proteins were 

10 linked to the Fab' portions of two different antibodies by gene fusion. The antibody 

homodimers were reduced at the hinge region to form monomers and then re-oxidized to 
form the antibody heterodimers. This method can also be utilized for the production of 
antibody homodimers. The "diabody" technology described by Hollinger et al., Proc. 
Natl. Acad. Sci. USA 90:6444-6448 (1993) has provided an alternative mechanism for 

1 5 making bispecific antibody fragments. The fragments comprise a heavy-chain variable 
domain (V H ) connected to a light-chain variable domain (V L ) by a linker which is too short 
to allow pairing between the two domains on the same chain. Accordingly, the Vh and Vl 
domains of one fragment are forced to pair with the complementary Vl and Vh domains of 
another fragment, thereby forming two antigen-binding sites. Another strategy for making 

20 bispecific antibody fragments by the use of single-chain Fv (sFv) dimers has also been 
reported. See, Gruber et al., J. Immunol. 152:5368 (1994). 

Antibodies with more than two valencies are contemplated. For example, 
trispecific antibodies can be prepared. Tutt et al., J. Immunol. 147:60 (1991). 

Exemplary bispecific antibodies can bind to two different epitopes, at least one of 

25 which originates in the protein antigen of the invention. Alternatively, an anti-antigenic 
arm of an immunoglobulin molecule can be combined with an arm which binds to a 
triggering molecule on a leukocyte such as a T-cell receptor molecule (e.g. CD2, CD3, 
CD28, or B7), or Fc receptors for IgG (FcyR), such as FcyRI (CD64), FcyRII (CD32) and 
FcyRIIl (CD16) so as to focus cellular defense mechanisms to the cell expressing the 

30 particular antigen. Bispecific antibodies can also be used to direct cytotoxic agents to cells 

which express a particular antigen. These antibodies possess an antigen-binding arm and 

an arm which binds a cytotoxic agent or a radionuclide chelator, such as EOTUBE, DPTA, 
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DOTA, or TETA. Another bispecific antibody of interest binds the protein antigen 
described herein and further binds tissue factor (TF). 

Heteroconjugate Antibodies 

Heteroconjugate antibodies are also within the scope of the present invention. 

5 Heteroconjugate antibodies are composed of two covalently joined antibodies. Such 
antibodies have, for example, been proposed to target immune system cells to unwanted 
cells (U.S. Patent No. 4,676,980), and for treatment of HIV infection (WO 91/00360; WO 
92/200373; EP 03089). It is contemplated that the antibodies can be prepared in vitro 
using known methods in synthetic protein chemistry, including those involving 

10 crosslinking agents. For example, immunotoxins can be constructed using a disulfide 
exchange reaction or by forming a thioether bond. Examples of suitable reagents for this 
purpose include iminothiolate and methyl-4-mercaptobutyrimidate and those disclosed, for 
example, in U.S. Patent No. 4,676,980. 

Effector Function Engineering 

1 5 It can be desirable to modify the antibody of the invention with respect to effector 

function, so as to enhance, eg., the effectiveness of the antibody in treating cancer. For 
example, cysteine residue(s) can be introduced into the Fc region, thereby allowing 
interchain disulfide bond formation in this region. The homodimeric antibody thus 
generated can have unproved internalization capability and/or increased 

20 complement-mediated cell killing and antibody-dependent cellular cytotoxicity (ADCC). 
See Caron et al., J. Exp Med., 176: 1 191-1 195 (1992) and Shopes, J. Immunol, 148: 
2918-2922 (1992). Homodimeric antibodies with enhanced anti-tumor activity can also be 
prepared using heterobifunctional cross-linkers as described in Wolff et al. Cancer 
Research, 53: 2560-2565 (1993). Alternatively, an antibody can be engineered that has 

25 dual Fc regions and can thereby have enhanced complement lysis and ADCC capabilities. 
See Stevenson et al., Anti-Cancer Drug Design, 3: 219-230 (1989). 

Immunoconjugates 

The invention also pertains to immunoconjugates comprising an antibody 
conjugated to a cytotoxic agent such as a chemotherapeutic agent, toxin (e.g., an 
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enzymatically active toxin of bacterial, fungal, plant, or animal origin, or fragments 
thereof), or a radioactive isotope (i.e., a radioconjugate). 

Chemotherapeutic agents useful in the generation of such immunoconjugates have 
been described above. Enzymatically active toxins and fragments thereof that can be used 

5 include diphtheria A chain, nonbinding active fragments of diphtheria toxin, exotoxin A 
chain (from Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, 
alpha-sarcin, Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins 
(PAPI, PAPII, and PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria 
officinalis inhibitor, gelonin, mitogellin, restrictocin, phenomycin, enomycin, and the 

10 tricothecenes. A variety of radionuclides are available for the production of 
radioconjugated antibodies. Examples include 212 Bi, 131 1, 131 In, 90 Y, and 186 Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of 
bifunctional protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) 
propionate (SPDP), iminothiolane (IT), bifunctional derivatives of imidoesters (such as 

1 5 dimethyl adipimidate HCL), active esters (such as disuccinimidyl suberate), aldehydes 
(such as glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) 
hexanediamine), bis-diazonium derivatives (such as 
bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates (such as tolyene 
2,6-diisocyanate), and bis-active fluorine compounds (such as 

20 1 ,5-difluoro-2,4-dinitrobenzene). For example, a ricin immunotoxin can be prepared as 
described in Vitetta et al., Science. 238 : 1098 (1987). Carbon- 14-labeled 
l-isothiocyanatobenzyl-3-methyldiethylene triaminepentaacetic acid (MX-DTPA) is an 
exemplary chelating agent for conjugation of radionucleotide to the antibody. See 
WO94/11026. 

25 In another embodiment, the antibody can be conjugated to a "receptor" (such 

streptavidin) for utilization in tumor pretargeting wherein the antibody-receptor conjugate 
is administered to the patient, followed by removal of unbound conjugate from the 
circulation using a clearing agent and then administration of a "ligand" (e.g., avidin) that is 
in turn conjugated to a cytotoxic agent. 
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Immunoliposomes 

The antibodies disclosed herein can also be formulated as immunoliposomes. 
Liposomes containing the antibody are prepared by methods known in the art, such as 
described in Epstein et al., Proc. Natl. Acad. Sci. USA, 82: 3688 (1985); Hwang et al., 

5 Proc. Natl Acad. Sci. USA, 77: 4030 (1980); and U.S. Pat. Nos. 4,485,045 and 4,544,545. 
Liposomes with enhanced circulation time are disclosed in U.S. Patent No. 5,013,556. 

Particularly useful liposomes can be generated by the reverse-phase evaporation 
method with a lipid composition comprising phosphatidylcholine, cholesterol, and 
PEG-derivatized phosphatidylethanolamine (PEG-PE). Liposomes are extruded through 

10 filters of defined pore size to yield liposomes with the desired diameter. Fab 1 fragments of 
the antibody of the present invention can be conjugated to the liposomes as described in 
Martin et al .J. Biol. Chem., 257: 286-288 (1982) via a disulfide-interchange reaction. A 
chemotherapeutic agent (such as Doxorubicin) is optionally contained within the 
liposome. See Gabizon et al, J. National Cancer Inst., 81(19): 1484 (1989). 

1 5 Diagnostic Applications of Antibodies Directed Against the Proteins of the 

Invention 

In one embodiment, methods for the screening of antibodies that possess the 
desired specificity include, but are not limited to, enzyme linked immunosorbent assay 
(ELISA) and other immunologically mediated techniques known within the art. In a 
20 specific embodiment, selection of antibodies that are specific to a particular domain of an 
NOVX protein is facilitated by generation of hybridomas that bind to the fragment of an 
NOVX protein possessing such a domain. Thus, antibodies that are specific for a desired 
domain within an NOVX protein, or derivatives, fragments, analogs or homologs thereof, 
are also provided herein. 

25 Antibodies directed against a NOVX protein of the invention may be used in 

methods known within the art relating to the localization and/or quantitation of a NOVX 
protein (eg., for use in measuring levels of the NOVX protein within appropriate 
physiological samples, for use in diagnostic methods, for use in imaging the protein, and 
the like). In a given embodiment, antibodies specific to a NOVX protein, or derivative, 

30 fragment, analog or homolog thereof, that contain the antibody derived antigen binding 
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domain, are utilized as pharmacologically active compounds (referred to hereinafter as 
"Therapeutics"). 

An antibody specific for a NOVX protein of the invention (e.g., a monoclonal 
antibody or a polyclonal antibody) can be used to isolate a NOVX polypeptide by standard 

5 techniques, such as immunoaffinity, chromatography or immunoprecipitation. An 

antibody to a NOVX polypeptide can facilitate the purification of a natural NOVX antigen 
from cells, or of a recombinantly produced NOVX antigen expressed in host cells. 
Moreover, such an anti-NOVX antibody can be used to detect the antigenic NOVX protein 
(e.g., in a cellular lysate or cell supernatant) in order to evaluate the abundance and pattern 

1 0 of expression of the antigenic NOVX protein. Antibodies directed against a NOVX 
protein can be used diagnostically to monitor protein levels in tissue as part of a clinical 
testing procedure, eg., to, for example, determine the efficacy of a given treatment 
regimen. Detection can be facilitated by coupling (ie., physically linking) the antibody to 
a detectable substance. Examples of detectable substances include various enzymes, 

1 5 prosthetic groups, fluorescent materials, luminescent materials, bioluminescent materials, 
and radioactive materials. Examples of suitable enzymes include horseradish peroxidase, 
alkaline phosphatase, p-galactosidase, or acetylcholinesterase; examples of suitable 
prosthetic group complexes include streptavidin/biotin and avidin/biotin; examples of 
suitable fluorescent materials include umbelliferone, fluorescein, fluorescein 

20 isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or 
phycoerythrin; an example of a luminescent material includes luminol; examples of 
bioluminescent materials include luciferase, luciferin, and aequorin, and examples of 
suitable radioactive material include 125 I, l3l I, 35 S or 3 H. 

Antibody Therapeutics 

25 Antibodies of the invention, including polyclonal, monoclonal, humanized and 

fully human antibodies, may used as therapeutic agents. Such agents are generally 
employed to treat or prevent a disease or pathology in a subject An antibody preparation, 
preferably one having high specificity and high affinity for its target antigen, is 
administered to the subject and generally has an effect due to its binding with the target. 

30 Such an effect may be one of two kinds, depending on the specific nature of the interaction 
between the given antibody molecule and the target antigen in question. In the first 
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instance, administration of the antibody may abrogate or inhibit the binding of the target 
with an endogenous ligand to which it naturally binds. In this case, the antibody binds to 
the target and masks a binding site of the naturally occurring ligand, wherein the ligand 
serves as an effector molecule. Thus the receptor mediates a signal transduction pathway 

5 for which ligand is responsible. 

Alternatively, the effect may be one in which the antibody elicits a physiological 
result by virtue of binding to an effector binding site on the target molecule. In this case 
the target, a receptor having an endogenous ligand which may be absent or defective in the 
disease or pathology, binds the antibody as a surrogate effector ligand, initiating a 

1 0 receptor-based signal transduction event by the receptor. 

A therapeutically effective amount of an antibody of the invention relates generally 
to the amount needed to achieve a therapeutic objective. As noted above, this may be a 
binding interaction between the antibody and its target antigen that, in certain cases, 
interferes with the functioning of the target, and in other cases, promotes a physiological 

1 5 response. The amount required to be administered will furthermore depend on the binding 
affinity of the antibody for its specific antigen, and will also depend on the rate at which 
an administered antibody is depleted from the free volume other subject to which it is 
administered. Common ranges for therapeutically effective dosing of an antibody or 
antibody fragment of the invention may be, by way of nonlimiting example, from about 

20 0.1 mg/kg body weight to about 50 mg/kg body weight Common dosing frequencies may 
range, for example, from twice daily to once a week. 

Pharmaceutical Compositions of Antibodies 

Antibodies specifically binding a protein of the invention, as well as other 
molecules identified by the screening assays disclosed herein, can be administered for the 

25 treatment of various disorders in the form of pharmaceutical compositions. Principles and 
considerations involved in preparing such compositions, as well as guidance in the choice 
of components are provided, for example, in Remington : The Science And Practice Of 
Pharmacy 19th ed. (Alfonso R. Gennaro, et al., editors) Mack Pub. Co., Easton, Pa. : 1995; 
Drug Absorption Enhancement : Concepts, Possibilities, Limitations, And Trends, 

30 Harwood Academic Publishers, Langhorne, Pa., 1994; and Peptide And Protein Drug 
Delivery (Advances In Parenteral Sciences, Vol. 4), 1991, M. Dekker, New York. 
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If the antigenic protein is intracellular and whole antibodies are used as inhibitors, 
internalizing antibodies are preferred. However, liposomes can also be used to deliver the 
antibody, or an antibody fragment, into cells. Where antibody fragments are used, the 
smallest inhibitory fragment that specifically binds to the binding domain of the target 

5 protein is preferred. For example, based upon the variable-region sequences of an 
antibody, peptide molecules can be designed that retain the ability to bind the target 
protein sequence. Such peptides can be synthesized chemically and/or produced by 
recombinant DNA technology. See, e.g., Marasco et al., Proc. Natl. Acad. Sci. USA, 90: 
7889-7893 (1993). The formulation herein can also contain more than one active 

10 compound as necessary for the particular indication being treated, preferably those with 
complementary activities that do not adversely affect each other. Alternatively, or in 
addition, the composition can comprise an agent that enhances its function, such as, for 
example, a cytotoxic agent, cytokine, chemotherapeutic agent, or growth-inhibitory agent. 
Such molecules are suitably present in combination in amounts that are effective for the 

1 5 purpose intended. 

The active ingredients can also be entrapped in microcapsules prepared, for 
example, by coacervation techniques or by interfacial polymerization, for example, 
hydroxymethylcellulose or gelatin-microcapsules and poly-(methylmethacrylate) 
microcapsules, respectively, in colloidal drug delivery systems (for example, liposomes, 

20 albumin microspheres, microemulsions, nano-particles, and nanocapsules) or in 
macroemulsions. 

The formulations to be used for in vivo administration must be sterile. This is 
readily accomplished by filtration through sterile filtration membranes. 

Sustained-release preparations can be prepared. Suitable examples of 

25 sustained-release preparations include semipermeable matrices of solid hydrophobic 

polymers containing the antibody, which matrices are in the form of shaped articles, e.g., 
films, or microcapsules. Examples of sustained-release matrices include polyesters, 
hydrogels (for example, poly(2-hydroxyethyl-methacrylate), or poly(vinylalcohol)), 
polylactides (U.S. Pat. No. 3,773,919), copolymers of L-glutamic acid and y 

30 ethyl-L-glutamate, non-degradable ethylene-vinyl acetate, degradable lactic acid-glycolic 

acid copolymers such as the LUPRON DEPOT ™ (injectable microspheres composed of 

lactic acid-glycolic acid copolymer and leuprolide acetate), and 
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poly-D-(-)-3-hydroxybutyric acid. While polymers such as ethylene-vinyl acetate and 
lactic acid-glycolic acid enable release of molecules for over 100 days, certain hydrogels 
release proteins for shorter time periods. 

ELISA Assay 

5 An agent for detecting an analyte protein is an antibody capable of binding to an 

analyte protein, preferably an antibody with a detectable label. Antibodies can be 
polyclonal, or more preferably, monoclonal. An intact antibody, or a fragment thereof 
(e.g., F ab or F (ab )2) can be used. The term "labeled", with regard to the probe or antibody, 
is intended to encompass direct labeling of the probe or antibody by coupling 

10 physically linking) a detectable substance to the probe or antibody, as well as indirect 

labeling of the probe or antibody by reactivity with another reagent that is directly labeled. 
Examples of indirect labeling include detection of a primary antibody using a 
fluorescently-labeled secondary antibody and end-labeling of a DNA probe with biotin 
such that it can be detected with fluorescently-labeled streptavidin. The term "biological 

1 5 sample" is intended to include tissues, cells and biological fluids isolated from a subject, 
as well as tissues, cells and fluids present within a subject. Included within the usage of 
the term "biological sample", therefore, is blood and a fraction or component of blood 
including blood serum, blood plasma, or lymph. That is, the detection method of the 
invention can be used to detect an analyte mRNA, protein, or genomic DNA in a 

20 biological sample in vitro as well as in vivo. For example, in vitro techniques for detection 
of an analyte mRNA include Northern hybridizations and in situ hybridizations. In vitro 
techniques for detection of an analyte protein include enzyme linked immunosorbent 
assays (ELISAs), Western blots, immunopiecipitations, and immunofluorescence. In vitro 
techniques for detection of an analyte genomic DNA include Southern hybridizations. 

25 Procedures for conducting immunoassays are described, for example in "ELISA: Theory 
and Practice: Methods in Molecular Biology", Vol. 42, J. R. Crowther (Ed.) Human Press, 
Totowa, NJ, 1995; "Immunoassay", E. Diamandis and T. Christopoulus, Academic Press, 
Inc., San Diego, CA, 1996; and "Practice and Theory of Enzyme Immunoassays", P. 
Tijssen, Elsevier Science Publishers, Amsterdam, 1985. Furthermore, in vivo techniques 

30 for detection of an analyte protein include introducing into a subject a labeled anti-an 
analyte protein antibody. For example, the antibody can be labeled with a radioactive 
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marker whose presence and location in a subject can be detected by standard imaging 
techniques. 

NOVX Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression vectors, 

5 containing a nucleic acid encoding a NOVX protein, or derivatives, fragments, analogs or 
homologs thereof. As used herein, the term "vector" refers to a nucleic acid molecule 
capable of transporting another nucleic acid to which it has been linked. One type of 
vector is a "plasmid", which refers to a circular double stranded DNA loop into which 
additional DNA segments can be ligated. Another type of vector is a viral vector, wherein 

10 additional DNA segments can be ligated into the viral genome. Certain vectors are 
capable of autonomous replication in a host cell into which they are introduced (e.g., 
bacterial vectors having a bacterial origin of replication and episomal mammalian 
vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the 
genome of a host cell upon introduction into the host cell, and thereby are replicated along 

15 with the host genome. Moreover, certain vectors are capable of directing the expression of 
genes to which they are operatively-linked. Such vectors are referred to herein as 
"expression vectors". In general, expression vectors of utility in recombinant DNA 
techniques are often in the form of plasmids. In the present specification, "plasmid" and 
"vector" can be used interchangeably as the plasmid is the most commonly used form of 

20 vector. However, the invention is intended to include such other forms of expression 
vectors, such as viral vectors (e.g., replication defective retroviruses, adenoviruses and 
adeno-associated viruses), which serve equivalent functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of the 
invention in a form suitable for expression of the nucleic acid in a host cell, which means 

25 that the recombinant expression vectors include one or more regulatory sequences, 

selected on the basis of the host cells to be used for expression, that is operatively-linked 
to the nucleic acid sequence to be expressed. Within a recombinant expression vector, 
"operably-linked" is intended to mean that the nucleotide sequence of interest is linked to 
the regulatory sequence(s) in a manner that allows for expression of the nucleotide 

30 sequence (eg., in an in vitro transcription/translation system or in a host cell when the 
vector is introduced into the host cell). 
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The term "regulatory sequence" is intended to includes promoters, enhancers and 
other expression control elements (e.g., polyadenylation signals). Such regulatory 
sequences are described, for example, in Goeddel, GENE EXPRESSION TECHNOLOGY: 
METHODS IN ENZYMOLOGY 185, Academic Press, San Diego, Calif. (1990). Regulatory 

5 sequences include those that direct constitutive expression of a nucleotide sequence in 
many types of host cell and those that direct expression of the nucleotide sequence only in 
certain host cells (e.g., tissue-specific regulatory sequences). It will be appreciated by 
those skilled in the art that the design of the expression vector can depend on such factors 
as the choice of the host cell to be transformed, the level of expression of protein desired, 

10 etc. The expression vectors of the invention can be introduced into host cells to thereby 
produce proteins or peptides, including fusion proteins or peptides, encoded by nucleic 
acids as described herein (e.g., NOVX proteins, mutant forms of NOVX proteins, fusion 
proteins, etc.). 

The recombinant expression vectors of the invention can be designed for 

15 expression of NOVX proteins in prokaryotic or eukaryotic cells. For example, NOVX 
proteins can be expressed in bacterial cells such as Escherichia coli, insect cells (using 
baculo virus expression vectors) yeast cells or mammalian cells. Suitable host cells are 
discussed further in Goeddel, Gene Expression Technology: Methods in 
ENZYMOLOGY 185, Academic Press, San Diego, Calif. (1990). Alternatively, the 

20 recombinant expression vector can be transcribed and translated in vitro, for example 
using T7 promoter regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most often carried out in Escherichia coli 
with vectors containing constitutive or inducible promoters directing the expression of 
either fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a 

25 protein encoded therein, usually to the amino terminus of the recombinant protein. Such 
fusion vectors typically serve three purposes: (0 to increase expression of recombinant 
protein; (n) to increase the solubility of the recombinant protein; and (hi) to aid in the 
purification of the recombinant protein by acting as a ligand in affinity purification. 
Often, in fusion expression vectors, a proteolytic cleavage site is introduced at the junction 

30 of the fusion moiety and the recombinant protein to enable separation of the recombinant 

protein from the fusion moiety subsequent to purification of the fusion protein. Such 

enzymes, and their cognate recognition sequences, include Factor Xa, thrombin and 
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enterokinase. Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; 
Smith and Johnson, 1988. Gene 67: 31-40), pMAL (New England Biolabs, Beverly, 
Mass.) and pRIT5 (Pharmacia, Piscataway, NJ.) that fuse glutathione S-transferase 
(GST), maltose E binding protein, or protein A, respectively, to the target recombinant 
5 protein. 

Examples of suitable inducible non-fusion E. coli expression vectors include pTrc 
(Amrann e/ a/., (1988) Gene 69:301-315) andpET lld(Studieref a/., GENE EXPRESSION 
TECHNOLOGY: METHODS IN ENZYMOLOGY 185, Academic Press, San Diego, Calif. (1990) 
60-89). 

10 One strategy to maximize recombinant protein expression in E. coli is to express 

the protein in a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein. See, e.g., Gottesman, GENE EXPRESSION TECHNOLOGY: METHODS IN 
ENZYMOLOGY 185, Academic Press, San Diego, Calif. (1990) 1 19-128. Another strategy 
is to alter the nucleic acid sequence of the nucleic acid to be inserted into an expression 

15 vector so that the individual codons for each amino acid are those preferentially utilized in 
E. coli {see, e.g., Wada, et a/., 1992. Nucl. Acids Res. 20: 2111-2118). Such alteration of 
nucleic acid sequences of the invention can be carried out by standard DNA synthesis 
techniques. 

In another embodiment, the NOVX expression vector is a yeast expression vector. 

20 Examples of vectors for expression in yeast Saccharomyces cerivisae include p YepSec 1 

(Baldari, et al. y 1987. EMBOJ. 6: 229-234), pMFa (Kurjan and Herskowitz, 1982. Cell 

30: 933-943), pJRY88 (Schultz et a/., 1987. Gene 54: 113-123), pYES2 (Invitrogen 

Corporation, San Diego, Calif.), and picZ (InVitrogen Corp, San Diego, Calif.). 

Alternatively, NOVX can be expressed in insect cells using baculovirus expression 

25 vectors. Baculovirus vectors available for expression of proteins in cultured insect cells 

(eg., SF9 cells) include the pAc series (Smith, et al. t 1983. Mol Cell Biol. 3: 2156-2165) 

and the pVL series (Lucklow and Summers, 1989. Virology 170: 31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in 

mammalian cells using a mammalian expression vector. Examples of mammalian 

30 expression vectors include pCDM8 (Seed, 1987. Nature 329: 840) and pMT2PC 

(Kaufinan, et al. f \9X1. EMBOJ. 6: 187-195). When used in mammalian cells, the 

expression vector's control functions are often provided by viral regulatory elements. For 
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example, commonly used promoters are derived from polyoma, adenovirus 2, 
cytomegalovirus, and simian virus 40. For other suitable expression systems for both 
prokaryotic and eukaryotic cells see, e.g., Chapters 16 and 17 of Sambrook, et al. 9 
Molecular Cloning: A Laboratory Manual. 2nd ed., Cold Spring Haibor 

5 Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989. 

In another embodiment, the recombinant mammalian expression vector is capable 
of directing expression of the nucleic acid preferentially in a particular cell type (e.g., 
tissue-specific regulatory elements are used to express the nucleic acid). Tissue-specific 
regulatory elements are known in the art. Non-limiting examples of suitable 

10 tissue-specific promoters include the albumin promoter (liver-specific; Pinkert, et a/., 
1987. Genes Dev. 1: 268-277), lymphoid-specific promoters (Calame and Eaton, 1988. 
Adv. Immunol. 43: 235-275), in particular promoters of T cell receptors (Winoto and 
Baltimore, 1989. EMBOJ. 8: 729-733) and immunoglobulins (Banerji, et al. y 1983. Cell 
33: 729-740; Queen and Baltimore, 1983. Cell 33: 741-748), neuron-specific promoters 

15 (e.g., the neurofilament promoter; Byrne and Ruddle, 1989. Proc. Natl. Acad. Sci. USA 86: 
5473-5477), pancreas-specific promoters (Edlund, et al. 9 1985. Science 230: 912-916), and 
mammary gland-specific promoters (e.g., milk whey promoter; U.S. Pat. No. 4,873,316 
and European Application Publication No. 264,166). Developmentally-regulated 
promoters are also encompassed, e.g., the murine hox promoters (Kessel and Gruss, 1990. 

20 Science 249: 374-379) and the a-fetoprotein promoter (Campes and Tilghman, 1 989. 
Genes Dev. 3: 537-546). 

The invention further provides a recombinant expression vector comprising a DNA 
molecule of the invention cloned into the expression vector in an antisense orientation. 
That is, the DNA molecule is operatively-linked to a regulatory sequence in a manner that 

25 allows for expression (by transcription of the DNA molecule) of an RNA molecule that is 
antisense to NOVX mRNA. Regulatory sequences operatively linked to a nucleic acid 
cloned in the antisense orientation can be chosen that direct the continuous expression of 
the antisense RNA molecule in a variety of cell types, for instance viral promoters and/or 
enhancers, or regulatory sequences can be chosen that direct constitutive, tissue specific or 

30 cell type specific expression of antisense RNA. The antisense expression vector can be in 
the form of a recombinant plasmid, phagemid or attenuated virus in which antisense 
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nucleic acids are produced under the control of a high efficiency regulatory region, the 
activity of which can be determined by the cell type into which the vector is introduced. 
For a discussion of the regulation of gene expression using antisense genes see, e.g., 
Weintraub, et al., "Antisense RNA as a molecular tool for genetic analysis," 

5 Reviews-Trends in Genetics, Vol. 1(1) 1986. 

Another aspect of the invention pertains to host cells into which a recombinant 
expression vector of the invention has been introduced. The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. It is understood that such terms 
refer not only to the particular subject cell but also to the progeny or potential progeny of 

10 such a cell. Because certain modifications may occur in succeeding generations due to 
either mutation or environmental influences, such progeny may not, in fact, be identical to 
the parent cell, but are still included within the scope of the term as used herein. 

A host cell can be any prokaryotic or eukaryotic cell. For example, NOVX protein 
can be expressed in bacterial cells such as E. coli, insect cells, yeast or mammalian cells 

1 5 (such as Chinese hamster ovary cells (CHO) or COS cells). Other suitable host cells are 
known to those skilled in the art. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of art-recognized 

20 techniques for introducing foreign nucleic acid (e.g. , DNA) into a host cell, including 
calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 
transfection, lipofection, or electroporation. Suitable methods for transforming or 
transfecting host cells can be found in Sambrook, et al. (MOLECULAR CLONING: A 
LABORATORY Manual. 2nd ed., Cold Spring Harbor Laboratory, Cold Spring Harbor 

25 Laboratory Press, Cold Spring Harbor, N.Y., 1989), and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 
expression vector and transfection technique used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker (e.g. t resistance to antibiotics) is 

30 generally introduced into the host cells along with the gene of interest. Various selectable 

markers include those that confer resistance to drugs, such as G418, hygromycin and 

methotrexate. Nucleic acid encoding a selectable marker can be introduced into a host cell 
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on the same vector as that encoding NOVX or can be introduced on a separate vector. 
Cells stably transfected with the introduced nucleic acid can be identified by drug 
selection (eg., cells that have incorporated the selectable marker gene will survive, while 
the other cells die). 

5 A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, 

can be used to produce (i.e., express) NOVX protein. Accordingly, the invention further 
provides methods for producing NOVX protein using the host cells of the invention. In 
one embodiment, the method comprises culturing the host cell of invention (into which a 
recombinant expression vector encoding NOVX protein has been introduced) in a suitable 

10 medium such that NOVX protein is produced. In another embodiment, the method further 
comprises isolating NOVX protein from the medium or the host cell. 

Transgenic NOVX Animals 

The host cells of the invention can also be used to produce non-human transgenic 
animals. For example, in one embodiment, a host cell of the invention is a fertilized 

1 5 oocyte or an embryonic stem cell into which NOVX protein-coding sequences have been 
introduced. Such host cells can then be used to create non-human transgenic animals in 
which exogenous NOVX sequences have been introduced into their genome or 
homologous recombinant animals in which endogenous NOVX sequences have been 
altered. Such animals are useful for studying the function and/or activity of NOVX 

20 protein and for identifying and/or evaluating modulators of NOVX protein activity. As 
used herein, a "transgenic animal" is a non-human animal, preferably a mammal, more 
preferably a rodent such as a rat or mouse, in which one or more of the cells of the animal 
includes a transgene. Other examples of transgenic animals include non-human primates, 
sheep, dogs, cows, goats, chickens, amphibians, etc. A transgene is exogenous DNA that 

25 is integrated into the genome of a cell from which a transgenic animal develops and that 
remains in the genome of the mature animal, thereby directing the expression of an 
encoded gene product in one or more cell types or tissues of the transgenic animal. As 
used herein, a "homologous recombinant animal" is a non-human animal, preferably a 
mammal, more preferably a mouse, in which an endogenous NOVX gene has been altered 

30 by homologous recombination between the endogenous gene and an exogenous DNA 
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molecule introduced into a cell of the animal, e.g., an embryonic cell of the animal, prior 
to development of the animal. 

A transgenic animal of the invention can be created by introducing 
NOVX-encoding nucleic acid into the male pronuclei of a fertilized oocyte (e.g., by 
5 microinjection, retroviral infection) and allowing the oocyte to develop in a 

pseudopregnant female foster animal. The human NOVX cDNA sequences, &&, any one 
of SEQ ID NO:2n-l, wherein n is an integer between 1 and 12, can be introduced as a 
transgene into the genome of a non-human animal. Alternatively, a non-human 
homologue of the human NOVX gene, such as a mouse NOVX gene, can be isolated 

10 based on hybridization to the human NOVX cDNA (described further supra) and used as a 
transgene. Intronic sequences and polyadenylation signals can also be included in the 
transgene to increase the efficiency of expression of the transgene. A tissue-specific 
regulatory sequence(s) can be operably-linked to the NOVX transgene to direct expression 
of NOVX protein to particular cells. Methods for generating transgenic animals via 

15 embryo manipulation and microinjection, particularly animals such as mice, have become 
conventional in the art and are described, for example, in U.S. Patent Nos. 4,736,866; 
4,870,009; and 4,873,191; and Hogan, 1986. In: Manipulating the Mouse Embryo, 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. Similar methods are 
used for production of other transgenic animals. A transgenic founder animal can be 

20 identified based upon the presence of the NOVX transgene in its genome and/or 

expression of NOVX mRNA in tissues or cells of the animals. A transgenic founder 
animal can then be used to breed additional animals carrying the transgene. Moreover, 
transgenic animals carrying a transgene-encoding NOVX protein can further be bred to 
other transgenic animals carrying other transgenes. 

25 To create a homologous recombinant animal, a vector is prepared which contains 

at least a portion of a NOVX gene into which a deletion, addition or substitution has been 
introduced to thereby alter, e.g. , functionally disrupt, the NOVX gene. The NOVX gene 
can be a human gene (e.g., the cDNA of any one of SEQ ID NO:2n-l, wherein n is an 
integer between 1 and 12), but more preferably, is a non-human homologue of a human 

30 NOVX gene. For example, a mouse homologue of human NOVX gene of SEQ ID 

NO:2/i-l, wherein n is an integer between 1 and 12, can be used to construct a 

homologous recombination vector suitable for altering an endogenous NOVX gene in the 
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mouse genome. In one embodiment, the vector is designed such that, upon homologous 
recombination, the endogenous NOVX gene is functionally disrupted (i.e., no longer 
encodes a functional protein; also referred to as a "knock out" vector). 

Alternatively, the vector can be designed such that, upon homologous 

5 recombination, the endogenous NOVX gene is mutated or otherwise altered but still 
encodes functional protein (e.g., the upstream regulatory region can be altered to thereby 
alter the expression of the endogenous NOVX protein). In the homologous recombination 
vector, the altered portion of the NOVX gene is flanked at its 5'- and S'-termini by 
additional nucleic acid of the NOVX gene to allow for homologous recombination to 

1 0 occur between the exogenous NOVX gene carried by the vector and an endogenous 

NOVX gene in an embryonic stem cell. The additional flanking NOVX nucleic acid is of 
sufficient length for successful homologous recombination with the endogenous gene. 
Typically, several kilobases of flanking DNA (both at the 5'- and 3'-termini) are included 
in the vector. See, e.g., Thomas, et al., 1987. Cell 51: 503 for a description of homologous 

15 recombination vectors. The vector is ten introduced into an embryonic stem cell line (e.g., 
by electroporation) and cells in which the introduced NOVX gene has 
homologously-recombined with the endogenous NOVX gene are selected. See, e.g., Li, et 
at., 1992. Cell 69: 915. 

The selected cells are then injected into a blastocyst of an animal (e.g., a mouse) to 

20 form aggregation chimeras. See, e.g., Bradley, 1987. In: TERATOCARCINOMAS AND 
Embryonic Stem Cells: A Practical Approach, Robertson, ed. IRL, Oxford, pp. 
1 13-152. A chimeric embryo can then be implanted into a suitable pseudopregnant female 
foster animal and the embryo brought to term. Progeny harboring the 
homologously-recombined DNA in their germ cells can be used to breed animals in which 

25 all cells of the animal contain the homologously-recombined DNA by germline 

transmission of the transgene. Methods for constructing homologous recombination 
vectors and homologous recombinant animals are described further in Bradley, 1991 . 
Curr. Opin. Biotechnol 2: 823-829; PCT International Publication Nos.: WO 90/1 1354; 
WO 91/01 140; WO 92/0968; and WO 93/04169. 

30 In another embodiment, transgenic non-humans animals can be produced that 

contain selected systems that allow for regulated expression of the transgene. One 

example of such a system is the cre/loxP recombinase system of bacteriophage PI . For a 
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description of the cre/loxP recombinase system, See, e.g., Lakso, et a/., 1992. Proc. Natl 
Acad. ScL USA 89: 6232-6236. Another example of a recombinase system is the FLP 
recombinase system of Saccharomyces cerevisiae. See, O'Gorman, et a/., 1991 . Science 
251 : 1351-1355. If a cre/loxP recombinase system is used to regulate expression of the 
5 transgene, animals containing transgenes encoding both the Cre recombinase and a 

selected protein are required. Such animals can be provided through the construction of 
"double" transgenic animals, e.g., by mating two transgenic animals, one containing a 
transgene encoding a selected protein and the other containing a transgene encoding a 
recombinase. 

1 0 Clones of the non-human transgenic animals described herein can also be produced 

according to the methods described in Wilmut, et a/., 1997. Nature 385: 810-813. In brief, 
a cell (eg., a somatic cell) from the transgenic animal can be isolated and induced to exit 
the growth cycle and enter G 0 phase. The quiescent cell can then be fused, eg., through 
the use of electrical pulses, to an enucleated oocyte from an animal of the same species 

1 5 from which the quiescent cell is isolated. The reconstructed oocyte is then cultured such 
that it develops to morula or blastocyte and then transferred to pseudopregnant female 
foster animal. The offspring borne of this female foster animal will be a clone of the 
animal from which the cell (eg., the somatic cell) is isolated. 

Pharmaceutical Compositions 

20 The NOVX nucleic acid molecules, NOVX proteins, and anti-NOVX antibodies 

(also referred to herein as "active compounds") of the invention, and derivatives, 
fragments, analogs and homologs thereof, can be incorporated into pharmaceutical 
compositions suitable for administration. Such compositions typically comprise the 
nucleic acid molecule, protein, or antibody and a pharmaceutical^ acceptable carrier. As 

25 used herein, "pharmaceutically acceptable carrier" is intended to include any and all 
solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic and 
absorption delaying agents, and the like, compatible with pharmaceutical administration. 
Suitable carriers are described in the most recent edition of Remington's Pharmaceutical 
Sciences, a standard reference text in the field, which is incorporated herein by reference. 

30 Preferred examples of such carriers or diluents include, but are not limited to, water, 
saline, finger's solutions, dextrose solution, and 5% human serum albumin. Liposomes 
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and non-aqueous vehicles such as fixed oils may also be used. The use of such media and 
agents for pharmaceutical^ active substances is well known in the art. Except insofar as 
any conventional media or agent is incompatible with the active compound, use thereof in 
the compositions is contemplated. Supplementary active compounds can also be 

5 incorporated into the compositions. 

A pharmaceutical composition of the invention is formulated to be compatible with 
its intended route of administration. Examples of routes of administration include 
parenteral, e.g. 9 intravenous, intradermal, subcutaneous, oral (e.g., inhalation), transdermal 
(i.e., topical), transmucosal, and rectal administration. Solutions or suspensions used for 

10 parenteral, intradermal, or subcutaneous application can include the following 

components: a sterile diluent such as water for injection, saline solution, fixed oils, 
polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; antibacterial 
agents such as benzyl alcohol or methyl parabens; antioxidants such as ascorbic acid or 
sodium bisulfite; chelating agents such as ethylenediaminetetraacetic acid (EDTA); 

IS buffers such as acetates, citrates or phosphates, and agents for the adjustment of tonicity 
such as sodium chloride or dextrose. The pH can be adjusted with acids or bases, such as 
hydrochloric acid or sodium hydroxide. The parenteral preparation can be enclosed in 
ampoules, disposable syringes or multiple dose vials made of glass or plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 

20 solutions (where water soluble) or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersion. For intravenous administration, 
suitable carriers include physiological saline, bacteriostatic water, Cremophor EL™ 
(BASF, Parsippany, N.J.) or phosphate buffered saline (PBS). In all cases, the 
composition must be sterile and should be fluid to the extent that easy syringeability 

25 exists. It must be stable under the conditions of manufacture and storage and must be 

preserved against the contaminating action of microorganisms such as bacteria and fungi. 
The carrier can be a solvent or dispersion medium containing, for example, water, ethanol, 
polyol (for example, glycerol, propylene glycol, and liquid polyethylene glycol, and the 
like), and suitable mixtures thereof. The proper fluidity can be maintained, for example, 

30 by the use of a coating such as lecithin, by the maintenance of the required particle size in 
the case of dispersion and by the use of surfactants. Prevention of the action of 
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microorganisms can be achieved by various antibacterial and antifungal agents, for 
example, parabens, chlorobutanol, phenol, ascorbic acid, thimerosal, and the like. In many 
cases, it is be preferable to include isotonic agents, for example, sugars, polyalcohols such 
as manitol, sorbitol, sodium chloride in the composition. Prolonged absorption of the 

5 injectable compositions can be brought about by including in the composition an agent 
which delays absorption, for example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active compound 
(eg., a NOVX protein or anti-NOVX antibody) in the required amount in an appropriate 
solvent with one or a combination of ingredients enumerated above, as required, followed 

10 by filtered sterilization. Generally, dispersions are prepared by incorporating the active 
compound into a sterile vehicle that contains a basic dispersion medium and the required 
other ingredients from those enumerated above. In the case of sterile powders for the 
preparation of sterile injectable solutions, methods of preparation are vacuum drying and 
freeze-drying that yields a powder of the active ingredient plus any additional desired 

1 5 ingredient from a previously sterile-filtered solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They can 
be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 
used in the form of tablets, troches, or capsules. Oral compositions can also be prepared 

20 using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. Pharmaceutically compatible 
binding agents, and/or adjuvant materials can be included as part of the composition. The 
tablets, pills, capsules, troches and the like can contain any of the following ingredients, or 
compounds of a similar nature: a binder such as microcrystalline cellulose, gum tragacanth 

25 or gelatin; an excipient such as starch or lactose, a disintegrating agent such as alginic 
acid, Primogel, or corn starch; a lubricant such as magnesium stearate or Sterotes; a 
glidant such as colloidal silicon dioxide; a sweetening agent such as sucrose or saccharin; 
or a flavoring agent such as peppermint, methyl salicylate, or orange flavoring. 

For administration by inhalation, the compounds are delivered in the form of an 

30 aerosol spray from pressured container or dispenser which contains a suitable propellant, 
a gas such as carbon dioxide, or a nebulizer. 
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Systemic administration can also be by transmucosal or transdermal means. For 
transmucosal or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art, 
and include, for example, for transmucosal administration, detergents, bile salts, and 
5 fusidic acid derivatives. Transmucosal administration can be accomplished through the 
use of nasal sprays or suppositories. For transdermal administration, the active 
compounds are formulated into ointments, salves, gels, or creams as generally known in 
the art. 

The compounds can also be prepared in the form of suppositories (e.g., with 

1 0 conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

In one embodiment, the active compounds are prepared with carriers that protect 
the compound against rapid elimination from the body, such as a controlled release 
formulation, including implants and microencapsulated delivery systems. Biodegradable, 

15 biocompatible polymers can be used, such as ethylene vinyl acetate, polyanhydrides, 

polyglycolic acid, collagen, polyorthoesters, and polylactic acid. Methods for preparation 
of such formulations will be apparent to those skilled in the art. The materials can also be 
obtained commercially from Alza Corporation and Nova Pharmaceuticals, Inc. Liposomal 
suspensions (including liposomes targeted to infected cells with monoclonal antibodies to 

20 viral antigens) can also be used as pharmaceutically acceptable carriers. These can be 

prepared according to methods known to those skilled in the art, for example, as described 
in U.S. Patent No. 4,522,811. 

It is especially advantageous to formulate oral or parenteral compositions in dosage 
unit form for ease of administration and uniformity of dosage. Dosage unit form as used 

25 herein refers to physically discrete units suited as unitary dosages for the subject to be 
treated; each unit containing a predetermined quantity of active compound calculated to 
produce the desired therapeutic effect in association with the required pharmaceutical 
carrier. The specification for the dosage unit forms of the invention are dictated by and 
directly dependent on the unique characteristics of the active compound and the particular 

30 therapeutic effect to be achieved, and the limitations inherent in the art of compounding 
such an active compound for the treatment of individuals. 
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The nucleic acid molecules of the invention can be inserted into vectors and used 
as gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for 
example, intravenous injection, local administration (see, e.g., U.S. Patent No. 5,328,470) 
or by stereotactic injection (see, e.g. y Chen, et aL 9 1994. Proc. Natl Acad. ScL USA 91: 

5 3054-3057). The pharmaceutical preparation of the gene therapy vector can include the 
gene therapy vector in an acceptable diluent, or can comprise a slow release matrix in 
which the gene delivery vehicle is imbedded. Alternatively, where the complete gene 
delivery vector can be produced intact from recombinant cells, e.g., retroviral vectors, the 
pharmaceutical preparation can include one or more cells that produce the gene delivery 

10 system. 

The pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 

Screening and Detection Methods 

The isolated nucleic acid molecules of the invention can be used to express NOVX 

15 protein (e.g., via a recombinant expression vector in a host cell in gene therapy 

applications), to detect NOVX mRNA (e.g., in a biological sample) or a genetic lesion in a 
NOVX gene, and to modulate NOVX activity, as described further, below. In addition, 
the NOVX proteins can be used to screen drugs or compounds that modulate the NOVX 
protein activity or expression as well as to treat disorders characterized by insufficient or 

20 excessive production of NOVX protein or production of NOVX protein forms that have 
decreased or aberrant activity compared to NOVX wild-type protein (e.g. \ diabetes 
(regulates insulin release); obesity (binds and transport lipids); metabolic disturbances 
associated with obesity, the metabolic syndrome X as well as anorexia and wasting 
disorders associated with chronic diseases and various cancers, and infectious 

25 disease(possesses anti-microbial activity) and the various dyslipidemias. In addition, the 
anti-NOVX antibodies of the invention can be used to detect and isolate NOVX proteins 
and modulate NOVX activity. In yet a further aspect, the invention can be used in methods 
to influence appetite, absorption of nutrients and the disposition of metabolic substrates in 
both a positive and negative fashion. 

30 The invention further pertains to novel agents identified by the screening assays 

described herein and uses thereof for treatments as described, supra. 
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Screening Assays 

The invention provides a method (also referred to herein as a "screening assay") 
for identifying modulators, i.e., candidate or test compounds or agents (eg., peptides, 
peptidomimetics, small molecules or other drugs) that bind to NOVX proteins or have a 
5 stimulatory or inhibitory effect on, e.g., NOVX protein expression or NOVX protein 
activity. The invention also includes compounds identified in the screening assays 
described herein. 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which bind to or modulate the activity of the membrane-bound form of a 

10 NOVX protein or polypeptide or biologically-active portion thereof. The test compounds 
of the invention can be obtained using any of the numerous approaches in combinatorial 
library methods known in the art, including: biological libraries; spatially addressable 
parallel solid phase or solution phase libraries; synthetic library methods requiring 
deconvolution; the "one-bead one-compound" library method; and synthetic library 

1 5 methods using affinity chromatography selection. The biological library approach is 
limited to peptide libraries, while the other four approaches are applicable to peptide, 
non-peptide oligomer or small molecule libraries of compounds. See, e.g., Lam, 1997. 
Anticancer Drug Design 12: 145. 

A "small molecule" as used herein, is meant to refer to a composition that has a 

20 molecular weight of less than about 5 kD and most preferably less than about 4 kD. Small 
molecules can be, e.g., nucleic acids, peptides, polypeptides, peptidomimetics, 
carbohydrates, lipids or other organic or inorganic molecules. Libraries of chemical 
and/or biological mixtures, such as fungal, bacterial, or algal extracts, are known in the art 
and can be screened with any of the assays of the invention. 

25 Examples of methods for the synthesis of molecular libraries can be found in the 

art, for example in: DeWitt, et al y 1993. Proc. Natl. Acad. ScL U.S.A. 90: 6909; Erb, et a/., 
1994. Proc. Natl. Acad. Sci. U.S.A. 91: 1 1422; Zuckermann, et al. 9 1994. J. Med. Chem. 
37: 2678; Cho,e/a/., 1993. Science 261: 1303; Carrell, etal., l994.Angew. Chem.Int.Ed. 
Engl. 33: 2059; Carell, et al. f 1994. Angew. Chem. Int. Ed. Engl. 33: 2061; and Gallop, et 

30 al. 9 1994. J. Med. Chem. 37: 1233. 
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Libraries of compounds may be presented in solution {e.g., Houghten, 1992. 
Biotechniques 13: 412-421), or on beads (Lam, 1991. Nature 354: 82-84), on chips 
(Fodor, 1993. Nature 364: 555-556), bacteria (Ladner, U.S. Patent No. 5,223,409), spores 
(Ladner, U.S. Patent 5,233,409), plasmids (Cull, et al y 1 992. Proc. Natl Acad. Sci. USA 
5 89: 1865-1869) or on phage (Scott and Smith, 1990. Science 249: 386-390; Devlin, 1990. 
Science 249: 404-406; Cwirla, et al, 1990. Proc. Natl Acad Set USA. 87: 6378-6382; 
Felici, 1991.7. Mol Biol 222: 301-310; Ladner, U.S. Patent No. 5,233,409.). 

In one embodiment, an assay is a cell-based assay in which a cell which expresses 
a membrane-bound form of NOVX protein, or a biologically-active portion thereof, on the 

1 0 cell surface is contacted with a test compound and the ability of the test compound to bind 
to a NOVX protein determined. The cell, for example, can of mammalian origin or a yeast 
cell. Determining the ability of the test compound to bind to the NOVX protein can be 
accomplished, for example, by coupling the test compound with a radioisotope or 
enzymatic label such that binding of the test compound to the NOVX protein or 

1 5 biologically-active portion thereof can be determined by detecting the labeled compound 
in a complex. For example, test compounds can be labeled with 125 1, 35 S, 14 C, or 3 H, either 
directly or indirectly, and the radioisotope detected by direct counting of radioemission or 
by scintillation counting. Alternatively, test compounds can be enzymatically-labeled 
with, for example, horseradish peroxidase, alkaline phosphatase, or luciferase, and the 

20 enzymatic label detected by determination of conversion of an appropriate substrate to 
product. In one embodiment, the assay comprises contacting a cell which expresses a 
membrane-bound form of NOVX protein, or a biologically-active portion thereof, on the 
cell surface with a known compound which binds NOVX to form an assay mixture, 
contacting the assay mixture with a test compound, and determining the ability of the test 

25 compound to interact with a NOVX protein, wherein determining the ability of the test 
compound to interact with a NOVX protein comprises determining the ability of the test 
compound to preferentially bind to NOVX protein or a biologically-active portion thereof 
as compared to the known compound. 

In another embodiment, an assay is a cell-based assay comprising contacting a cell 

30 expressing a membrane-bound form of NOVX protein, or a biologically-active portion 

thereof, on the cell surface with a test compound and determining the ability of the test 

compound to modulate (e.g., stimulate or inhibit) the activity of the NOVX protein or 
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biologically-active portion thereof Determining the ability of the test compound to 
modulate the activity of NOVX or a biologically-active portion thereof can be 
accomplished, for example, by determining the ability of the NOVX protein to bind to or 
interact with a NOVX target molecule. As used herein, a "target molecule" is a molecule 

5 with which a NOVX protein binds or interacts in nature, for example, a molecule on the 
surface of a cell which expresses a NOVX interacting protein, a molecule on the surface of 
a second cell, a molecule in the extracellular milieu, a molecule associated with the 
internal surface of a cell membrane or a cytoplasmic molecule. A NOVX target molecule 
can be a non-NOVX molecule or a NOVX protein or polypeptide of the invention. In one 

10 embodiment, a NOVX target molecule is a component of a signal transduction pathway 
that facilitates transduction of an extracellular signal (e.g. a signal generated by binding of 
a compound to a membrane-bound NOVX molecule) through the cell membrane and into 
the cell. The target, for example, can be a second intercellular protein that has catalytic 
activity or a protein that facilitates the association of downstream signaling molecules with 

15 NOVX. 

Determining the ability of the NOVX protein to bind to or interact with a NOVX 
target molecule can be accomplished by one of the methods described above for 
determining direct binding. In one embodiment, determining the ability of the NOVX 
protein to bind to or interact with a NOVX target molecule can be accomplished by 

20 determining the activity of the target molecule. For example, the activity of the target 
molecule can be determined by detecting induction of a cellular second messenger of the 
target (i.e. intracellular Ca 2+ , diacylglycerol, IP3, etc.), detecting catalytic/enzymatic 
activity of the target an appropriate substrate, detecting the induction of a reporter gene 
(comprising a NOVX-responsive regulatory element operatively linked to a nucleic acid 

25 encoding a detectable marker, e.g., luciferase), or detecting a cellular response, for 
example, cell survival, cellular differentiation, or cell proliferation. 

In yet another embodiment, an assay of the invention is a cell-free assay 
comprising contacting a NOVX protein or biologically-active portion thereof with a test 
compound and determining the ability of the test compound to bind to the NOVX protein 

30 or biologically-active portion thereof. Binding of the test compound to the NOVX protein 

can be determined either directly or indirectly as described above. In one such 

embodiment, the assay comprises contacting the NOVX protein or biologically-active 
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portion thereof with a known compound which binds NOVX to form an assay mixture, 
contacting the assay mixture with a test compound, and determining the ability of the test 
compound to interact with a NOVX protein, wherein determining the ability of the test 
compound to interact with a NOVX protein comprises determining the ability of the test 

5 compound to preferentially bind to NOVX or biologically-active portion thereof as 
compared to the known compound. 

In still another embodiment, an assay is a cell-free assay comprising contacting 
NOVX protein or biologically-active portion thereof with a test compound and 
determining the ability of the test compound to modulate (eg. stimulate or inhibit) the 

1 0 activity of the NOVX protein or biologically-active portion thereof. Determining the 
ability of the test compound to modulate the activity of NOVX can be accomplished, for 
example, by determining the ability of the NOVX protein to bind to a NOVX target 
molecule by one of the methods described above for determining direct binding. In an 
alternative embodiment, determining the ability of the test compound to modulate the 

1 5 activity of NOVX protein can be accomplished by determining the ability of the NOVX 
protein further modulate a NOVX target molecule. For example, the catalytic/enzymatic 
activity of the target molecule on an appropriate substrate can be determined as described, 
supra. 

In yet another embodiment, the cell-free assay comprises contacting the NOVX 
20 protein or biologically-active portion thereof with a known compound which binds NOVX 
protein to form an assay mixture, contacting the assay mixture with a test compound, and 
determining the ability of the test compound to interact with a NOVX protein, wherein 
determining the ability of the test compound to interact with a NOVX protein comprises 
determining the ability of the NOVX protein to preferentially bind to or modulate the 
25 activity of a NOVX target molecule. 

The cell-free assays of the invention are amenable to use of both the soluble form 
or the membrane-bound form of NOVX protein. In the case of cell-free assays comprising 
the membrane-bound form of NOVX protein, it may be desirable to utilize a solubilizing 
agent such that the membrane-bound form of NOVX protein is maintained in solution. 
30 Examples of such solubilizing agents include non-ionic detergents such as 

n-octylglucoside, n-dodecylglucoside, n-dodecylmaltoside, octanoyl-N-methylglucamide, 
decanoyl-N-methylglucamide, Triton® X-100, Triton® X-l 14, Thesit®, 
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Isotridecypoly(ethylene glycol ether) n , N-dodecyl--N,N-dimethyl-3-ammonio-l -propane 
sulfonate, 3-(3-cholamidopropyl) dimethylamminiol-1 -propane sulfonate (CHAPS), or 
3-(3-cholamidopropyl)dimethylamminiol-2-hydroxy-l -propane sulfonate (CHAPSO). 

In more than one embodiment of the above assay methods of the invention, it may 

5 be desirable to immobilize either NOVX protein or its target molecule to facilitate 

separation of complexed from uncomplexed forms of one or both of the proteins, as well 
as to accommodate automation of the assay. Binding of a test compound to NOVX 
protein, or interaction of NOVX protein with a target molecule in the presence and 
absence of a candidate compound, can be accomplished in any vessel suitable for 

10 containing the reactants. Examples of such vessels include micro titer plates, test tubes, 
and micro-centrifuge tubes. In one embodiment, a fusion protein can be provided that 
adds a domain that allows one or both of the proteins to be bound to a matrix. For 
example, GST-NO VX fusion proteins or GST-target fusion proteins can be adsorbed onto 
glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione derivatized 

1 5 microtiter plates, that are then combined with the test compound or the test compound and 
either the non-adsorbed target protein or NOVX protein, and the mixture is incubated 
under conditions conducive to complex formation (eg., at physiological conditions for salt 
and pH). Following incubation, the beads or microtiter plate wells are washed to remove 
any unbound components, the matrix immobilized in the case of beads, complex 

20 determined either directly or indirectly, for example, as described, supra. Alternatively, 
the complexes can be dissociated from the matrix, and the level of NOVX protein binding 
or activity determined using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either the NOVX protein or its target 

25 molecule can be immobilized utilizing conjugation of biotin and streptavidin. Biotinylated 
NOVX protein or target molecules can be prepared from biotin-NHS 
(N-hydroxy-succinimide) using techniques well-known within the art (eg., biotinylation 
kit, Pierce Chemicals, Rockford, 111.), and immobilized in the wells of streptavidin-coated 
96 well plates (Pierce Chemical). Alternatively, antibodies reactive with NOVX protein 

30 or target molecules, but which do not interfere with binding of the NOVX protein to its 
target molecule, can be derivatized to the wells of the plate, and unbound target or NOVX 
protein trapped in the wells by antibody conjugation. Methods for detecting such 
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complexes, in addition to those described above for the GST-immobilized complexes, 
include immunodetection of complexes using antibodies reactive with the NOVX protein 
or target molecule, as well as enzyme-linked assays that rely on detecting an enzymatic 
activity associated with the NOVX protein or target molecule. 

5 In another embodiment, modulators of NOVX protein expression are identified in 

a method wherein a cell is contacted with a candidate compound and the expression of 
NOVX mRNA or protein in the cell is determined. The level of expression of NOVX 
mRNA or protein in the presence of the candidate compound is compared to the level of 
expression of NOVX mRNA or protein in the absence of the candidate compound. The 

10 candidate compound can then be identified as a modulator of NOVX mRNA or protein 
expression based upon this comparison. For example, when expression of NOVX mRNA 
or protein is greater (i.e., statistically significantly greater) in the presence of the candidate 
compound than in its absence, the candidate compound is identified as a stimulator of 
NOVX mRNA or protein expression. Alternatively, when expression of NOVX mRNA or 

15 protein is less (statistically significantly less) in the presence of the candidate compound 
than in its absence, the candidate compound is identified as an inhibitor of NOVX mRNA 
or protein expression. The level of NOVX mRNA or protein expression in the cells can be 
determined by methods described herein for detecting NOVX mRNA or protein. 

In yet another aspect of the invention, the NOVX proteins can be used as "bait 

20 proteins" in a two-hybrid assay or three hybrid assay (see, e.g., U.S. Patent No. 5,283,3 17; 
Zervos, et al. 9 1993. Cell 72: 223-232; Madura, et a/., 1993. J. Biol Chem. 268: 
12046-12054; Bartel, etaL, 1993. Biotechniques 14: 920-924; Iwabuchi, etal, 1993. 
Oncogene 8: 1693-1696; and Brent WO 94/10300), to identify other proteins that bind to 
or interact with NOVX ("NOVX-binding proteins" or "NOVX-bp") and modulate NOVX 

25 activity. Such NOVX-binding proteins are also involved in the propagation of signals by 
the NOVX proteins as, for example, upstream or downstream elements of the NOVX 
pathway. 

The two-hybrid system is based on the modular nature of most transcription 

factors, which consist of separable DNA-binding and activation domains. Briefly, the 

30 assay utilizes two different DNA constructs. In one construct, the gene that codes for 

NOVX is fused to a gene encoding the DNA binding domain of a known transcription 

factor (e.g., GAL-4). In the other construct, a DNA sequence, from a library of DNA 
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sequences, that encodes an unidentified protein ("prey" or "sample") is fused to a gene that 
codes for the activation domain of the known transcription factor. If the "bait" and the 
"prey" proteins are able to interact, in vivo, forming a NOVX-dependent complex, the 
DNA-binding and activation domains of the transcription factor are brought into close 

5 proximity. This proximity allows transcription of a reporter gene (eg., LacZ) that is 
operably linked to a transcriptional regulatory site responsive to the transcription factor. 
Expression of the reporter gene can be detected and cell colonies containing the functional 
transcription factor can be isolated and used to obtain the cloned gene that encodes the 
protein which interacts with NOVX. 

10 The invention further pertains to novel agents identified by the aforementioned 

screening assays and uses thereof for treatments as described herein. 

Detection Assays 

Portions or fragments of the cDNA sequences identified herein (and the 
corresponding complete gene sequences) can be used in numerous ways as polynucleotide 
1 5 reagents. By way of example, and not of limitation, these sequences can be used to: (i) 
map their respective genes on a chromosome; and, thus, locate gene regions associated 
with genetic disease; (if) identify an individual from a minute biological sample (tissue 
typing); and (Hi) aid in forensic identification of a biological sample. Some of these 
applications are described in the subsections, below. 

20 Chromosome Mapping 

Once the sequence (or a portion of the sequence) of a gene has been isolated, this 
sequence can be used to map the location of the gene on a chromosome. This process is 
called chromosome mapping. Accordingly, portions or fragments of the NOVX sequences 
of SEQ ID NO:2n-l, wherein n is an integer between 1 and 12, or fragments or derivatives 

25 thereof, can be used to map the location of the NOVX genes, respectively, on a 

chromosome. The mapping of the NOVX sequences to chromosomes is an important first 
step in correlating these sequences with genes associated with disease. 

Briefly, NOVX genes can be mapped to chromosomes by preparing PCR primers 
(preferably 15-25 bp in length) from the NOVX sequences. Computer analysis of the 

30 NOVX, sequences can be used to rapidly select primers that do not span more than one 
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exon in the genomic DNA, thus complicating the amplification process. These primers 
can then be used for PCR screening of somatic cell hybrids containing individual human 
chromosomes. Only those hybrids containing the human gene corresponding to the 
NOVX sequences will yield an amplified fragment. 

5 Somatic cell hybrids are prepared by fusing somatic cells from different mammals 

(e.g. 9 human and mouse cells). As hybrids of human and mouse cells grow and divide, 
they gradually lose human chromosomes in random order, but retain the mouse 
chromosomes. By using media in which mouse cells cannot grow, because they lack a 
particular enzyme, but in which human cells can, the one human chromosome that 

10 contains the gene encoding the needed enzyme will be retained. By using various media, 
panels of hybrid cell lines can be established. Each cell line in a panel contains either a 
single human chromosome or a small number of human chromosomes, and a full set of 
mouse chromosomes, allowing easy mapping of individual genes to specific human 
chromosomes. See, e.g„ D f Eustachio, et a/., 1983. Science 220: 919-924. Somatic cell 

1 5 hybrids containing only fragments of human chromosomes can also be produced by using 
human chromosomes with translocations and deletions. 

PCR mapping of somatic cell hybrids is a rapid procedure for assigning a particular 
sequence to a particular chromosome. Three or more sequences can be assigned per day 
using a single thermal cycler. Using the NOVX sequences to design oligonucleotide 

20 primers, sub-localization can be achieved with panels of fragments from specific 
chromosomes. 

Fluorescence in situ hybridization (FISH) of a DNA sequence to a metaphase 
chromosomal spread can further be used to provide a precise chromosomal location in one 
step. Chromosome spreads can be made using cells whose division has been blocked in 

25 metaphase by a chemical like colcemid that disrupts the mitotic spindle. The 

chromosomes can be treated briefly with trypsin, and then stained with Giemsa. A pattern 
of light and dark bands develops on each chromosome, so that the chromosomes can be 
identified individually. The FISH technique can be used with a DNA sequence as short as 
500 or 600 bases. However, clones larger than 1 ,000 bases have a higher likelihood of 

30 binding to a unique chromosomal location with sufficient signal intensity for simple 

detection. Preferably 1,000 bases, and more preferably 2,000 bases, will suffice to get 

good results at a reasonable amount of time. For a review of this technique, see, Verma, et 
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al. y Human Chromosomes: A Manual of Basic Techniques (Pergamon Press, New 
York 1988). 

Reagents for chromosome mapping can be used individually to mark a single 
chromosome or a single site on that chromosome, or panels of reagents can be used for 

S marking multiple sites and/or multiple chromosomes. Reagents corresponding to 
noncoding regions of the genes actually are preferred for mapping purposes. Coding 
sequences are more likely to be conserved within gene families, thus increasing the chance 
of cross hybridizations during chromosomal mapping. 

Once a sequence has been mapped to a precise chromosomal location, the physical 

10 position of the sequence on the chromosome can be correlated with genetic map data. 
Such data are found, e.g., in McKusick, Mendelian Inheritance in Man, available 
on-line through Johns Hopkins University Welch Medical Library). The relationship 
between genes and disease, mapped to the same chromosomal region, can then be 
identified through linkage analysis (co-inheritance of physically adjacent genes), described 

1 5 in, e.g., Egeland, et a/., 1987. Nature, 325: 783-787. 

Moreover, differences in the DNA sequences between individuals affected and 
unaffected with a disease associated with the NOVX gene, can be determined. If a 
mutation is observed in some or all of the affected individuals but not in any unaffected 
individuals, then the mutation is likely to be the causative agent of the particular disease. 

20 Comparison of affected and unaffected individuals generally involves first looking for 
structural alterations in the chromosomes, such as deletions or translocations that are 
visible from chromosome spreads or detectable using PGR based on that DNA sequence. 
Ultimately, complete sequencing of genes from several individuals can be performed to 
confirm the presence of a mutation and to distinguish mutations from polymorphisms. 

25 Tissue Typing 

The NOVX sequences of the invention can also be used to identify individuals 
from minute biological samples. In this technique, an individuals genomic DNA is 
digested with one or more restriction enzymes, and probed on a Southern blot to yield 
unique bands for identification. The sequences of the invention are useful as additional 
30 DNA markers for RFLP ("restriction fragment length polymorphisms," described in U.S. 
Patent No. 5,272,057). 
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Furthermore, the sequences of the invention can be used to provide an alternative 
technique that determines the actual base-by-base DNA sequence of selected portions of 
an individual's genome. Thus, the NOVX sequences described herein can be used to 
prepare two PCR primers from the 5'- and 3 -termini of the sequences. These primers can 

5 then be used to amplify an individual's DNA and subsequently sequence it. 

Panels of corresponding DNA sequences from individuals, prepared in this 
manner, can provide unique individual identifications, as each individual will have a 
unique set of such DNA sequences due to allelic differences. The sequences of the 
invention can be used to obtain such identification sequences from individuals and from 

10 tissue. The NOVX sequences of the invention uniquely represent portions of the human 
genome. Allelic variation occurs to some degree in the coding regions of these sequences, 
and to a greater degree in the noncoding regions. It is estimated that allelic variation 
between individual humans occurs with a frequency of about once per each 500 bases. 
Much of the allelic variation is due to single nucleotide polymorphisms (SNPs), which 

1 5 include restriction fragment length polymorphisms (RFLPs). 

Each of the sequences described herein can, to some degree, be used as a standard 
against which DNA from an individual can be compared for identification purposes. 
Because greater numbers of polymorphisms occur in the noncoding regions, fewer 
sequences are necessary to differentiate individuals. The noncoding sequences can 

20 comfortably provide positive individual identification with a panel of perhaps 10 to 1,000 
primers that each yield a noncoding amplified sequence of 100 bases. If coding 
sequences, such as those of SEQ ID NO:2/i-l , wherein n is an integer between 1 and 12, 
are used, a more appropriate number of primers for positive individual identification 
would be 500-2,000. 

25 Predictive Medicine 

The invention also pertains to the field of predictive medicine in which diagnostic 
assays, prognostic assays, pharmacogenomics, and monitoring clinical trials are used for 
prognostic (predictive) purposes to thereby treat an individual prophylactically. 
Accordingly, one aspect of the invention relates to diagnostic assays for determining 
30 NOVX protein and/or nucleic acid expression as well as NOVX activity, in the context of 
a biological sample (e.g., blood, serum, cells, tissue) to thereby determine whether an 
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individual is afflicted with a disease or disorder, or is at risk of developing a disorder, 
associated with aberrant NOVX expression or activity. The disorders include metabolic 
disorders, diabetes, obesity, infectious disease, anorexia, cancer-associated cachexia, 
cancer, neurodegenerative disorders, Alzheimer's Disease, Parkinson's Disorder, immune 

5 disorders, and hematopoietic disorders, and the various dyslipidemias, metabolic 

disturbances associated with obesity, the metabolic syndrome X and wasting disorders 
associated with chronic diseases and various cancers. The invention also provides for 
prognostic (or predictive) assays for determining whether an individual is at risk of 
developing a disorder associated with NOVX protein, nucleic acid expression or activity. 

10 For example, mutations in a NOVX gene can be assayed in a biological sample. Such 

assays can be used for prognostic or predictive purpose to thereby prophylactically treat an 
individual prior to the onset of a disorder characterized by or associated with NOVX 
protein, nucleic acid expression, or biological activity. 

Another aspect of the invention provides methods for determining NOVX protein, 

1 S nucleic acid expression or activity in an individual to thereby select appropriate 
therapeutic or prophylactic agents for that individual (referred to herein as 
"pharmacogenomics"). Pharmacogenomics allows for the selection of agents (e.g., drugs) 
for therapeutic or prophylactic treatment of an individual based on the genotype of the 
individual (eg., the genotype of the individual examined to determine the ability of the 

20 individual to respond to a particular agent.) 

Yet another aspect of the invention pertains to monitoring the influence of agents 
(e.g., drugs, compounds) on the expression or activity of NOVX in clinical trials. 

These and other agents are described in further detail in the following sections. 

Diagnostic Assays 

25 An exemplaiy method for detecting the presence or absence of NOVX in a 

biological sample involves obtaining a biological sample from a test subject and 
contacting the biological sample with a compound or an agent capable of detecting NOVX 
protein or nucleic acid (eg., mRNA, genomic DNA) that encodes NOVX protein such that 
the presence of NOVX is detected in the biological sample. An agent for detecting NOVX 

30 mRNA or genomic DNA is a labeled nucleic acid probe capable of hybridizing to NOVX 
mRNA or genomic DNA. The nucleic acid probe can be, for example, a full-length 
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NOVX nucleic acid, such as the nucleic acid of SEQ ID NO:2/i-l , wherein n is an integer 
between 1 and 12, or a portion thereof, such as an oligonucleotide of at least 15, 30, 50, 
100, 250 or 500 nucleotides in length and sufficient to specifically hybridize under 
stringent conditions to NOVX mRNA or genomic DNA. Other suitable probes for use in 

5 the diagnostic assays of die invention are described herein. 

An agent for detecting NOVX protein is an antibody capable of binding to NOVX 
protein, preferably an antibody with a detectable label. Antibodies can be polyclonal, or 
more preferably, monoclonal. An intact antibody, or a fragment thereof (e.g., Fab or 
F(ab t >2) can be used. The term "labeled", with regard to the probe or antibody, is intended 

10 to encompass direct labeling of the probe or antibody by coupling (i.e., physically linking) 
a detectable substance to the probe or antibody, as well as indirect labeling of the probe or 
antibody by reactivity with another reagent that is directly labeled. Examples of indirect 
labeling include detection of a primary antibody using a fluorescently-labeled secondary 
antibody and end-labeling of a DNA probe with biotin such that it can be detected with 

15 fluorescently-labeled streptavidin. The term "biological sample" is intended to include 
tissues, cells and biological fluids isolated from a subject, as well as tissues, cells and 
fluids present within a subject. That is, the detection method of the invention can be used 
to detect NOVX mRNA, protein, or genomic DNA in a biological sample in vitro as well 
as in vivo. For example, in vitro techniques for detection of NOVX mRNA include 

20 Northern hybridizations and in situ hybridizations. In vitro techniques for detection of 
NOVX protein include enzyme linked immunosorbent assays (ELISAs), Western blots, 
immunoprecipitations, and immunofluorescence. In vitro techniques for detection of 
NOVX genomic DNA include Southern hybridizations. Furthermore, in vivo techniques 
for detection of NOVX protein include introducing into a subject a labeled anti-NOVX 

25 antibody. For example, the antibody can be labeled with a radioactive marker whose 
presence and location in a subject can be detected by standard imaging techniques. 

In one embodiment, the biological sample contains protein molecules from the test 
subject. Alternatively, the biological sample can contain mRNA molecules from the test 
subject or genomic DNA molecules from the test subject. A preferred biological sample is 

30 a peripheral blood leukocyte sample isolated by conventional means from a subject. 

In another embodiment, the methods further involve obtaining a control biological 

sample from a control subject, contacting the control sample with a compound or agent 
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capable of detecting NOVX protein, mRNA, or genomic DNA, such that the presence of 
NOVX protein, mRNA or genomic DNA is detected in the biological sample, and 
comparing the presence of NOVX protein, mRNA or genomic DNA in the control sample 
with the presence of NOVX protein, mRNA or genomic DNA in the test sample. 

5 The invention also encompasses kits for detecting the presence of NOVX in a 

biological sample. For example, the kit can comprise: a labeled compound or agent 
capable of detecting NOVX protein or mRNA in a biological sample; means for 
determining the amount of NOVX in the sample; and means for comparing the amount of 
NOVX in the sample with a standard. The compound or agent can be packaged in a 

1 0 suitable container. The kit can further comprise instructions for using the kit to detect 
NOVX protein or nucleic acid. 

Prognostic Assays 

The diagnostic methods described herein can furthermore be utilized to identify 
subjects having or at risk of developing a disease or disorder associated with aberrant 

1 5 NOVX expression or activity. For example, the assays described herein, such as the 
preceding diagnostic assays or the following assays, can be utilized to identify a subject 
having or at risk of developing a disorder associated with NOVX protein, nucleic acid 
expression or activity. Alternatively, the prognostic assays can be utilized to identify a 
subject having or at risk for developing a disease or disorder. Thus, the invention provides 

20 a method for identifying a disease or disorder associated with aberrant NOVX expression 
or activity in which a test sample is obtained from a subject and NOVX protein or nucleic 
acid (e.g>, mRNA, genomic DNA) is detected, wherein the presence of NOVX protein or 
nucleic acid is diagnostic for a subject having or at risk of developing a disease or disorder 
associated with aberrant NOVX expression or activity. As used herein, a "test sample" 

25 refers to a biological sample obtained from a subject of interest For example, a test 
sample can be a biological fluid (e.g., serum), cell sample, or tissue. 

Furthermore, the prognostic assays described herein can be used to determine 
whether a subject can be administered an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate) to 

30 treat a disease or disorder associated with aberrant NOVX expression or activity. For 
example, such methods can be used to determine whether a subject can be effectively 
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treated with an agent for a disorder. Thus, the invention provides methods for determining 
whether a subject can be effectively treated with an agent for a disorder associated with 
aberrant NOVX expression or activity in which a test sample is obtained and NOVX 
protein or nucleic acid is detected (e.g., wherein the presence of NOVX protein or nucleic 

5 acid is diagnostic for a subject that can be administered the agent to treat a disorder 
associated with aberrant NOVX expression or activity). 

The methods of the invention can also be used to detect genetic lesions in a NOVX 
gene, thereby determining if a subject with the lesioned gene is at risk for a disorder 
characterized by aberrant cell proliferation and/or differentiation. In various 

10 embodiments, the methods include detecting, in a sample of cells from the subject, the 
presence or absence of a genetic lesion characterized by at least one of an alteration 
affecting the integrity of a gene encoding a NOVX-protein, or the misexpression of the 
NOVX gene. For example, such genetic lesions can be detected by ascertaining the 
existence of at least one of: (i) a deletion of one or more nucleotides from a NOVX gene; 

15 (ii) an addition of one or more nucleotides to a NOVX gene; (Hi) a substitution of one or 
more nucleotides of a NOVX gene, (iv) a chromosomal rearrangement of a NOVX gene; 
(v) an alteration in the level of a messenger RNA transcript of a NOVX gene, (vi) aberrant 
modification of a NOVX gene, such as of the methylation pattern of the genomic DNA, 
(v/i) the presence of a non-wild-type splicing pattern of a messenger RNA transcript of a 

20 NOVX gene, (v/ii) a non-wild-type level of a NOVX protein, (ix) allelic loss of a NOVX 
gene, and (x) inappropriate post-translational modification of a NOVX protein. As 
described herein, there are a large number of assay techniques known in the art which can 
be used for detecting lesions in a NOVX gene. A preferred biological sample is a 
peripheral blood leukocyte sample isolated by conventional means from a subject. 

25 However, any biological sample containing nucleated cells may be used, including, for 
example, buccal mucosal cells. 

In certain embodiments, detection of the lesion involves the use of a probe/primer 
in a polymerase chain reaction (PCR) (see, e.g., U.S. Patent Nos. 4,683,195 and 
4,683,202), such as anchor PCR or RACE PCR, or, alternatively, in a ligation chain 

30 reaction (LCR) (see, e.g., Landegran, et ai, 1988. Science 241 : 1077-1080; and 

Nakazawa, et al y 1994. Proc. Natl Acad. Set USA 91: 360-364), the latter of which can 

be particularly useful for detecting point mutations in the NOVX-gene (see, Abravaya, et 
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a/., 1995. Nucl. Acids Res. 23: 675-682). This method can include the steps of collecting a 
sample of cells from a patient, isolating nucleic acid (eg., genomic, mRNA or both) from 
the cells of the sample, contacting the nucleic acid sample with one or more primers that 
specifically hybridize to a NOVX gene under conditions such that hybridization and 

5 amplification of the NOVX gene (if present) occurs, and detecting the presence or absence 
of an amplification product, or detecting the size of the amplification product and 
comparing the length to a control sample. It is anticipated that PCR and/or LCR may be 
desirable to use as a preliminary amplification step in conjunction with any of the 
techniques used for detecting mutations described herein. 

1 0 Alternative amplification methods include: self sustained sequence replication (see, 

Guatelli, et aL, 1990. Proc. Natl. Acad. Set USA 87: 1874-1878), transcriptional 
amplification system (see, Kwoh, et ai 9 1989. Proc. Natl. Acad. Sci. USA 86: 1 173-1 177); 
QP Replicase (see, Lizardi, et aU 1988. BioTechnology 6: 1 197), or any other nucleic acid 
amplification method, followed by the detection of the amplified molecules using 

15 techniques well known to those of skill in the art. These detection schemes are especially 
useful for the detection of nucleic acid molecules if such molecules are present in very low 
numbers. 

In an alternative embodiment, mutations in a NOVX gene from a sample cell can 
be identified by alterations in restriction enzyme cleavage patterns. For example, sample 

20 and control DNA is isolated, amplified (optionally), digested with one or more restriction 
endonucleases, and fragment length sizes are determined by gel electrophoresis and 
compared. Differences in fragment length sizes between sample and control DNA 
indicates mutations in the sample DNA. Moreover, the use of sequence specific 
ribozymes (see, e.g., U.S. Patent No. 5,493,531) can be used to score for the presence of 

25 specific mutations by development or loss of a ribozyme cleavage site. 

In other embodiments, genetic mutations in NOVX can be identified by 
hybridizing a sample and control nucleic acids, e.g. f DNA or RNA, to high-density arrays 
containing hundreds or thousands of oligonucleotides probes. See, e.g., Cronin, et al. 9 
1996. Human Mutation 7: 244-255; Kozal, et al. 9 1996. Nat. Med. 2: 753-759. For 

30 example, genetic mutations in NOVX can be identified in two dimensional arrays 

containing light-generated DNA probes as described in Cronin, et al. y supra. Briefly, a 
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first hybridization array of probes can be used to scan through long stretches of DNA in a 
sample arid control to identify base changes between the sequences by making linear 
arrays of sequential overlapping probes. This step allows the identification of point 
mutations. This is followed by a second hybridization array that allows the 
5 characterization of specific mutations by using smaller, specialized probe arrays 

complementary to all variants or mutations detected. Each mutation array is composed of 
parallel probe sets, one complementary to the wild-type gene and the other complementary 
to the mutant gene. 

In yet another embodiment, any of a variety of sequencing reactions known in the 

10 art can be used to directly sequence the NOVX gene and detect mutations by comparing 
the sequence of the sample NOVX with the corresponding wild-type (control) sequence. 
Examples of sequencing reactions include those based on techniques developed by Maxim 
and Gilbert, 1977. Proc. Natl. Acad. Sci. USA 74: 560 or Sanger, 1977. Proc. Natl Acad. 
ScL USA 74: 5463. It is also contemplated that any of a variety of automated sequencing 

15 procedures can be utilized when performing the diagnostic assays (see, e.g., Naeve, et al. 9 
1995. Biotechniques 19: 448), including sequencing by mass spectrometry (see, e.g., PCT 
International Publication No. WO 94/16101; Cohen, et al., 1996. Adv. Chromatography 
36: 127-162; and Griffin, et ai, 1993. Appl Biochem. Biotechnol. 38: 147-159). 

Other methods for detecting mutations in the NOVX gene include methods in 

20 which protection from cleavage agents is used to detect mismatched bases in RNA/RNA 
orRNA/DNAheteroduplexes. See, e.g., Myers, etal. y 1985. Science 230: 1242. In 
general, the art technique of "mismatch cleavage" starts by providing heteroduplexes of 
formed by hybridizing (labeled) RNA or DNA containing the wild-type NOVX sequence 
with potentially mutant RNA or DNA obtained from a tissue sample. The double-stranded 

25 duplexes are treated with an agent that cleaves single-stranded regions of the duplex such 
as may exist due to basepair mismatches between the control and sample strands. For 
instance, RNA/DNA duplexes can be treated with RNase and DNA/DNA hybrids treated 
with Sj nuclease to enzyraatically digesting the mismatched regions. In other 
embodiments, either DNA/DNA or RNA/DNA duplexes can be treated with 

30 hydroxylamine or osmium tetroxide and with piperidine in order to digest mismatched 

regions. After digestion of the mismatched regions, the resulting material is then 

separated by size on denaturing polyacrylamide gels to determine the site of mutatioa 

120 . 



WO 03/080856 



PCT/US03/08490 



See, e.g., Cotton, et al t 1988. Proc. Natl Acad. Sci. USA 85: 4397; Saleeba, et al t 1992. 
Methods Enzymol 217: 286-295. In an embodiment, the control DNA or RNA can be 
labeled for detection. 

In still another embodiment, the mismatch cleavage reaction employs one or more 

5 proteins that recognize mismatched base pairs in double-stranded DNA (so called "DNA 
mismatch repair" enzymes) in defined systems for detecting and mapping point mutations 
in NOVX cDNAs obtained from samples of cells. For example, the mutY enzyme of E. 
coli cleaves A at G/A mismatches and the thymidine DNA glycosylase from HeLa cells 
cleaves T at G/T mismatches. See, e.g., Hsu, et a/., 1994. Carcinogenesis 15: 1657-1662. 

1 0 According to an exemplary embodiment, a probe based on a NOVX sequence, e.g. , a 
wild-type NOVX sequence, is hybridized to a cDNA or other DNA product from a test 
cell(s). The duplex is treated with a DNA mismatch repair enzyme, and the cleavage 
products, if any, can be detected from electrophoresis protocols or the like. See, e.g., U.S. 
Patent No. 5,459,039. 

1 5 In other embodiments, alterations in electrophoretic mobility can be used to 

identify mutations in NOVX genes. For example, single strand conformation 
polymorphism (SSCP) may be used to detect differences in electrophoretic mobility 
between mutant and wild type nucleic acids. See, e.g., Orita, et a/., 1 989. Proc. Natl 
Acad. Sci. USA: 86: 2766; Cotton, 1993. Mutat Res. 285: 125-144; Hayashi, 1992. Genet. 

20 Anal Tech. Appl 9: 73-79. Single-stranded DNA fragments of sample and control NOVX 
nucleic acids will be denatured and allowed to renature. The secondary structure of 
single-stranded nucleic acids varies according to sequence, the resulting alteration in 
electrophoretic mobility enables the detection of even a single base change. The DNA 
fragments can be labeled or detected with labeled probes. The sensitivity of the assay can 

25 be enhanced by using RNA (rather than DNA), in which the secondary structure is more 
sensitive to a change in sequence. In one embodiment, the subject method utilizes 
heteroduplex analysis to separate double stranded heteroduplex molecules on the basis of 
changes in electrophoretic mobility. See, e.g., Keen, et al, 1991. Trends Genet. 7: 5. 
In yet another embodiment, the movement of mutant or wild-type fragments in 

30 polyacrylamide gels containing a gradient of denaturant is assayed using denaturing 
gradient gel electrophoresis (DGGE). See, e.g., Myers, et al t 1985. Nature 313: 495. 
When DGGE is used as the method of analysis, DNA will be modified to insure that it 
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does not completely denature, for example by adding a GC clamp of approximately 40 bp 
of high-melting GC-rich DNA by PCR In a further embodiment, a temperature gradient 
is used in place of a denaturing gradient to identify differences in the mobility of control 
and sample DNA. See, e.g., Rosenbaum and Reissner, 1987. Biophys. Chem. 265: 12753. 

5 Examples of other techniques for detecting point mutations include, but are not 

limited to, selective oligonucleotide hybridization, selective amplification, or selective 
primer extension. For example, oligonucleotide primers can be prepared in which the 
known mutation is placed centrally and then hybridized to target DNA under conditions 
that permit hybridization only if a perfect match is found. See, e.g. t Saiki, et aL, 1986. 

10 Nature 324: 163; Saiki, et aL, 1989. Proc. Natl. Acad ScL USA 86: 6230. Such allele 
specific oligonucleotides are hybridized to PCR amplified target DNA or a number of 
different mutations when the oligonucleotides are attached to the hybridizing membrane 
and hybridized with labeled target DNA. 

Alternatively, allele specific amplification technology that depends on selective 

1 5 PCR amplification can be used in conjunction with the instant invention. Oligonucleotides 
used as primers for specific amplification can carry the mutation of interest in the center of 
the molecule (so that amplification depends on differential hybridization; see, e.g., Gibbs, 
et aL, 1989. NucL Acids Res. 17: 2437-2448) or at the extreme 3-terminus of one primer 
where, under appropriate conditions, mismatch can prevent, or reduce polymerase 

20 extension (see, e.g., Prossner, 1993. Tibtech. 11: 238). In addition it may be desirable to 
introduce a novel restriction site in the region of the mutation to create cleavage-based 
detection. See, e.g., Gasparini, et aL, 1992. MoL Cell Probes 6: 1. It is anticipated that in 
certain embodiments amplification may also be performed using Taq ligase for 
amplification. See, e.g., Barany, 1991. Proc. Natl. Acad. Sci. USA 88: 189. In such cases, 

25 ligation will occur only if there is a perfect match at the 3 '-terminus of the 5 f sequence, 
making it possible to detect the presence of a known mutation at a specific site by looking 
for the presence or absence of amplification. 

The methods described herein may be performed, for example, by utilizing 
pre-packaged diagnostic kits comprising at least one probe nucleic acid or antibody 

30 reagent described herein, which may be conveniently used, e.g. , in clinical settings to 

diagnose patients exhibiting symptoms or family history of a disease or illness involving a 
NOVX gene. 
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Furthermore, any cell type or tissue, preferably peripheral blood leukocytes, in 
which NO VX is expressed can be utilized in the prognostic assays described herein. 
However, any biological sample containing nucleated cells may be used, including, for 
example, buccal mucosal cells. 

5 Pharmacogenomics 

Agents, or modulators that have a stimulatory or inhibitory effect on NOVX 
activity (e.g., NOVX gene expression), as identified by a screening assay described herein 
can be administered to individuals to treat (prophylactically or therapeutically) disorders. 
The disorders include but are not limited to, e.g., those diseases, disorders and conditions 

10 listed above, and more particularly include those diseases, disorders, or conditions 
associated with homologs of a NOVX protein, such as those summarized in Table A. 

In conjunction with such treatment, the pharmacogenomics (i.e., the study of the 
relationship between an individual's genotype and that individual's response to a foreign 
compound or drug) of the individual may be considered. Differences in metabolism of 

1 5 therapeutics can lead to severe toxicity or therapeutic failure by altering the relation 
between dose and blood concentration of the pharmacologically active drug. Thus, the 
pharmacogenomics of the individual permits the selection of effective agents drugs) 
for prophylactic or therapeutic treatments based on a consideration of the individual's 
genotype. Such pharmacogenomics can further be used to determine appropriate dosages 

20 and therapeutic regimens. Accordingly, the activity of NOVX protein, expression of 
NOVX nucleic acid, or mutation content of NOVX genes in an individual can be 
determined to thereby select appropriate agent(s) for therapeutic or prophylactic treatment 
of the individual. 

Pharmacogenomics deals with clinically significant hereditary variations in the 
25 response to drugs due to altered drug disposition and abnormal action in affected persons. 
See e.g., Eichelbaum, 1996. Clin. Exp. Pharmacol. Physiol., 23: 983-985; Linder, 1997. 
Clin. Chem., 43: 254-266. In general, two types of pharmacogenetic conditions can be 
differentiated. Genetic conditions transmitted as a single factor altering the way drugs act 
on the body (altered drug action) or genetic conditions transmitted as single factors 
30 altering the way the body acts on drugs (altered drug metabolism). These 

pharmacogenetic conditions can occur either as rare defects or as polymorphisms. For 
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example, glucose-6-phosphate dehydrogenase (G6PD) deficiency is a common inherited 
enzymopathy in which the main clinical complication is hemolysis after ingestion of 
oxidant drugs (anti-malarials, sulfonamides, analgesics, nitrofurans) and consumption of 
fava beans. 

5 As an illustrative embodiment, the activity of drug metabolizing enzymes is a 

major determinant of both the intensity and duration of drug action. The discovery of 
genetic polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 (NAT 
2) and cytochrome pregnancy zone protein precursor enzymes CYP2D6 and CYP2C19) 
has provided an explanation as to why some patients do not obtain the expected drug 

10 effects or show exaggerated drug response and serious toxicity after taking the standard 
and safe dose of a drug. These polymorphisms are expressed in two phenotypes in the 
population, the extensive metabolizer (EM) and poor metabolizer (PM). The prevalence 
of PM is different among different populations. For example, the gene coding for 
CYP2D6 is highly polymorphic and several mutations have been identified in PM, which 

1 5 all lead to the absence of functional CYP2D6. Poor metabolizers of CYP2D6 and 

CYP2C19 quite frequently experience exaggerated drug response and side effects when 
they receive standard doses. If a metabolite is the active therapeutic moiety, PM show no 
therapeutic response, as demonstrated for the analgesic effect of codeine mediated by its 
CYP2D6-formed metabolite morphine. At the other extreme are the so called ultra-rapid 

20 metabolizers who do not respond to standard doses. Recently, the molecular basis of 
ultra-rapid metabolism has been identified to be due to CYP2D6 gene amplification. 

Thus, the activity of NOVX protein, expression of NOVX nucleic acid, or 
mutation content of NOVX genes in an individual can be determined to thereby select 
appropriate agent(s) for therapeutic or prophylactic treatment of the individual. In 

25 addition, pharmacogenetic studies can be used to apply genotyping of polymorphic alleles 
encoding drug-metabolizing enzymes to the identification of an individual's drug 
responsiveness phenotype. This knowledge, when applied to dosing or drug selection, can 
avoid adverse reactions or therapeutic failure and thus enhance therapeutic or prophylactic 
efficiency when treating a subject with a NOVX modulator, such as a modulator identified 

30 by one of the exemplary screening assays described herein. 
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Monitoring of Effects During Clinical Trials 

» 

Monitoring the influence of agents (e.g., drugs, compounds) on the expression or 
activity of NOVX (e.g., the ability to modulate aberrant cell proliferation and/or 
differentiation) can be applied not only in basic drug screening, but also in clinical trials. 

5 For example, the effectiveness of an agent determined by a screening assay as described 
herein to increase NOVX gene expression, protein levels, or upregulate NOVX activity, 
can be monitored in clinical trails of subjects exhibiting decreased NOVX gene 
expression, protein levels, or downregulated NOVX activity. Alternatively, the 
effectiveness of an agent determined by a screening assay to decrease NOVX gene 

10 expression, protein levels, or downregulate NOVX activity, can be monitored in clinical 
trails of subjects exhibiting increased NOVX gene expression, protein levels, or 
upregulated NOVX activity. In such clinical trials, the expression or activity of NOVX 
and, preferably, other genes that have been implicated in, for example, a cellular 
proliferation or immune disorder can be used as a "read out" or markers of the immune 

1 5 responsiveness of a particular cell. 

By way of example, and not of limitation, genes, including NOVX, that are 
modulated in cells by treatment with an agent (e.g., compound, drug or small molecule) 
that modulates NOVX activity (eg., identified in a screening assay as described herein) 
can be identified. Thus, to study the effect of agents on cellular proliferation disorders, for 

20 example, in a clinical trial, cells can be isolated and RNA prepared and analyzed for the 
levels of expression of NOVX and other genes implicated in the disorder. The levels of 
gene expression (/.&, a gene expression pattern) can be quantified by Northern blot 
analysis or RT-PCR, as described herein, or alternatively by measuring the amount of 
protein produced, by one of the methods as described herein, or by measuring the levels of 

25 activity of NOVX or other genes. In this manner, the gene expression pattern can serve as 
a marker, indicative of the physiological response of the cells to the agent. Accordingly, 
this response state can be determined before, and at various points during, treatment of the 
individual with the agent. 

In one embodiment, the invention provides a method for monitoring the 

30 effectiveness of treatment of a subject with an agent (e.g., an agonist, antagonist, protein, 
peptide, peptidomimetic, nucleic acid, small molecule, or other drug candidate identified 
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by the screening assays described herein) comprising the steps of (0 obtaining a 
pre-administration sample from a subject prior to administration of the agent; («) detecting 
the level of expression of a NOVX protein, mRNA, or genomic DNA in the 
preadministration sample; (Hi) obtaining one or more post-administration samples from 

5 the subject; (iv) detecting the level of expression or activity of the NOVX protein, mRNA, 
or genomic DNA in the post-administration samples; (v) comparing the level of expression 
or activity of the NOVX protein, mRNA, or genomic DNA in the pre-administration 
sample with the NOVX protein, mRNA, or genomic DNA in the post administration 
sample or samples; and (vi) altering the administration of the agent to the subject 

10 accordingly. For example, increased administration of the agent may be desirable to 

increase the expression or activity of NOVX to higher levels than detected, i.e., to increase 
the effectiveness of the agent Alternatively, decreased administration of the agent may be 
desirable to decrease expression or activity of NOVX to lower levels than detected, i.e. y to 
decrease the effectiveness of the agent. 

1 5 Methods of Treatment 

The invention provides for both prophylactic and therapeutic methods of treating a 
subject at risk of (or susceptible to) a disorder or having a disorder associated with 
aberrant NOVX expression or activity. The disorders include but are not limited to, e.g., 
those diseases, disorders and conditions listed above, and more particularly include those 
20 diseases, disorders, or conditions associated with homologs of a NOVX protein, such as 
those summarized in Table A. 

These methods of treatment will be discussed more fully, below. 

Diseases and Disorders 

Diseases and disorders that are characterized by increased (relative to a subject not 
25 suffering from the disease or disorder) levels or biological activity can be treated with 

Therapeutics that antagonize (i.e., reduce or inhibit) activity. Therapeutics that antagonize 
activity can be administered in a therapeutic or prophylactic manner. Therapeutics that 
can be utilized include, but are not limited to: (i) an aforementioned peptide, or analogs, 
derivatives, fragments or homologs thereof; (if) antibodies to an aforementioned peptide; 
30 (Hi) nucleic acids encoding an aforementioned peptide; (iv) administration of antisense 
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nucleic acid and nucleic acids that are "dysfunctional" (i.e., due to a heterologous insertion 
within the coding sequences of coding sequences to an aforementioned peptide) that are 
utilized to "knockout" endogenous function of an aforementioned peptide by homologous 
recombination (see, e.g., Capecchi, 1989. Science 244: 1288-1292); or (v) modulators ( 

S i.e. , inhibitors, agonists and antagonists, including additional peptide mimetic of the 
invention or antibodies specific to a peptide of the invention) that alter the interaction 
between an aforementioned peptide and its binding partner. 

Diseases and disorders that are characterized by decreased (relative to a subject not 
suffering from the disease or disorder) levels or biological activity can be treated with 

10 Therapeutics that increase (i.e., are agonists to) activity. Therapeutics that upregulate 
activity can be administered in a therapeutic or prophylactic manner. Therapeutics that 
can be utilized include, but are not limited to, an aforementioned peptide, or analogs, 
derivatives, fragments or homologs thereof; or an agonist that increases bioavailability. 

Increased or decreased levels can be readily detected by quantifying peptide and/or 

1 5 RNA, by obtaining a patient tissue sample {e.g., from biopsy tissue) and assaying it in 
vitro for RNA or peptide levels, structure and/or activity of the expressed peptides (or 
mRNAs of an aforementioned peptide). Methods that are well-known within the art 
include, but are not limited to, immunoassays (e.g., by Western blot analysis, 
immunoprecipitation followed by sodium dodecyl sulfate (SDS) polyacrylamide gel 

20 electrophoresis, immunocytochemistry, etc.) and/or hybridization assays to detect 

expression of mRNAs (e.g., Northern assays, dot blots, in situ hybridization, and the like). 

Prophylactic Methods 

In one aspect, the invention provides a method for preventing, in a subject, a 
disease or condition associated with an aberrant NOVX expression or activity, by 

25 administering to the subject an agent that modulates NOVX expression or at least one 
NOVX activity. Subjects at risk for a disease that is caused or contributed to by aberrant 
NOVX expression or activity can be identified by, for example, any or a combination of 
diagnostic or prognostic assays as described herein. Administration of a prophylactic 
agent can occur prior to the manifestation of symptoms characteristic of the NOVX 

30 aberrancy, such that a disease or disorder is prevented or, alternatively, delayed in its 
progression. Depending upon the type of NOVX aberrancy, for example, a NOVX 
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agonist or NOVX antagonist agent can be used for treating the subject. The appropriate 
agent can be determined based on screening assays described herein. The prophylactic 
methods of the invention are further discussed in the following subsections. 

Therapeutic Methods 

5 Another aspect of the invention pertains to methods of modulating NOVX 

expression or activity for therapeutic purposes. The modulatory method of the invention 
involves contacting a cell with an agent that modulates one or more of the activities of 
NOVX protein activity associated with the cell. An agent that modulates NOVX protein 
activity can be an agent as described herein, such as a nucleic acid or a protein, a 

1 0 naturally-occurring cognate ligand of a NOVX protein, a peptide, a NOVX 

peptidomimetic, or other small molecule. In one embodiment, the agent stimulates one or 
more NOVX protein activity. Examples of such stimulatory agents include active NOVX 
protein and a nucleic acid molecule encoding NOVX that has been introduced into the 
cell. In another embodiment, the agent inhibits one or more NOVX protein activity. 

1 5 Examples of such inhibitory agents include antisense NOVX nucleic acid molecules and 
anti-NOVX antibodies. These modulatory methods can be performed in vitro (eg., by 
culturing the cell with the agent) or, alternatively, in vivo (eg., by administering the agent 
to a subject). As such, the invention provides methods of treating an individual afflicted 
with a disease or disorder characterized by aberrant expression or activity of a NOVX 

20 protein or nucleic acid molecule. In one embodiment, the method involves administering 
an agent (eg., an agent identified by a screening assay described herein), or combination 
of agents that modulates (eg., up-regulates or down-regulates) NOVX expression or 
activity. In another embodiment, the method involves administering a NOVX protein or 
nucleic acid molecule as therapy to compensate for reduced or aberrant NOVX expression 

25 or activity. 

Stimulation of NOVX activity is desirable in s/fwations in which NOVX is 
abnormally downregulated and/or in which increased NOVX activity is likely to have a 
beneficial effect One example of such a situation is where a subject has a disorder 
characterized by aberrant cell proliferation and/or differentiation (eg., cancer or immune 
30 associated disorders). Another example of such a situation is where the subject has a 
gestational disease (eg., preclampsia). 
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Determination of the Biological Effect of the Therapeutic 

In various embodiments of the invention, suitable in vitro or in vivo assays are 
performed to determine the effect of a specific Therapeutic and whether its administration 
is indicated for treatment of the affected tissue. 

5 In various specific embodiments, in vitro assays can be performed with 

representative cells of the type(s) involved in the patient's disorder, to determine if a given 
Therapeutic exerts the desired effect upon the cell type(s). Compounds for use in therapy 
can be tested in suitable animal model systems including, but not limited to rats, mice, 
chicken, cows, monkeys, rabbits, and the like, prior to testing in human subjects. 

10 Similarly, for in vivo testing, any of the animal model system known in the art may be 
used prior to administration to human subjects. 

Prophylactic and Therapeutic Uses of the Compositions of the Invention 

The NOVX nucleic acids and proteins of the invention are useful in potential 
prophylactic and therapeutic applications implicated in a variety of disorders. The 

15 disorders include but are not limited to, e.g., those diseases, disorders and conditions listed 
above, and more particularly include those diseases, disorders, or conditions associated 
with homologs of a NOVX protein, such as those summarized in Table A. 

As an example, a cDNA encoding the NOVX protein of the invention may be 
useful in gene therapy, and the protein may be useful when administered to a subject in 

20 need thereof. By way of non-limiting example, the compositions of the invention will 
have efficacy for treatment of patients suffering from diseases, disorders, conditions and 
the like, including but not limited to those listed herein. 

Both the novel nucleic acid encoding the NOVX protein, and the NOVX protein of 
the invention, or fragments thereof, may also be useful in diagnostic applications, wherein 

25 the presence or amount of the nucleic acid or the protein are to be assessed. A further use 
could be as an anti-bacterial molecule (i.e., some peptides have been found to possess 
anti-bacterial properties). These materials are further useful in the generation of 
antibodies, which immunospecifically-bind to the novel substances of the invention for 
use in therapeutic or diagnostic methods. 

30 
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The invention will be further described in the following examples, which do not 
limit the scope of the invention described in the claims. 

EXAMPLES 

Example 1. Molecular Cloning of CG57008 
5 The open reading frame of CG57008 (SEQ ID NO: 4) codes for a 359 amino acid 

long, Type I transmembrane protein with a predicted N-terminal signal sequence 
represented by the first 20 residues. The predicted transmembrane domain starts at residue 
287. 

Cloning the mature CG57008 

10 Oligonucleotide primers were designed to PCR amplify a DNA segment, 

representing an ORF, coding for the mature form of CG57008 (NOV1). The forward 
primer includes, a BamHI restriction site while the reverse primer contains an, in frame, 
Xhol restriction site for further subcloning purposes. The sequences of the primers are 
shown below: 

15 HAVcr-1 FORW: 

GGATCCTCTGTAAAGGTTGGTGGAGAGGCAGGTCC (SEQ ID NO:25), 
HAVcr-1 FL-REV: 

CTCGAGGTCCGTGGCATAAAGACTATTCTCAATG (SEQ ID NO:26). 

PCR reactions were set up using a total of 5 ng cDNA template containing equal 

20 parts of cDNA samples derived from human testis, human mammary, human skeletal 

muscle , and fetal brain; 1 microM of each of the HAVcr-1 FORW and HAVcr-1 FL-REV 
primers, 5 micromoles dNTP (Clontech Laboratories, Palo Alto CA) and 1 microliter of 
50xAdvantage-HF 2 polymerase (Clontech Laboratories, Palo Alto CA) in 50 microliter 
volume. The following reaction conditions were used: 

25 a) 96°C 3 minutes 

b) 96°C 30 seconds denaturation 

c) 70°C 30 seconds, primer annealing. This temperature was gradually 
decreased by l°C/cycle 

d) 72°C 2 minute extension. 
30 Repeat steps b-d 1 0 times 
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e) 96°C 30 seconds denaturation 

f) 60°C 30 seconds annealing 

g) 72°C 2 minute extension 
Repeat steps e-g 25 times 

5 h) 72°C 5 minutes final extension 

An approximately 1 kbp large amplified product was isolated from agarose gel and 
ligated to pCR2.1 vector (Invitrogen, Carlsbad, CA). The cloned insert was sequenced, 
using vector specific, Ml 3 Forward (-40) and Ml 3 Reverse primers and verified as an 
open reading frame coding for the mature form of CGS7008. The clone is called pCR2. 1- 
10 CG57008-03-S843J5B. 

Table 35. The nucleotide sequence of the insert in pCR2.1- CG57008-03-S843 15B 

TCTGTAAAGGTTGGTGGAGAG<X^ 

GCTGTCACATCAATGTGCTCGAATAGAGGCrCATGTTCTCTAT^ 

ATTGTCTGGACCAATGGAACCCACGTCACCTATCGGAAGGACACACGCTATAAGCT 
1 5 GGGGACCTTTCAAGAAGGGATGTCTCTTTGACCATAGAAAATACAGCTGTGTCTGACAGT 

GGCGTATATTGTTGCCGTGTTGAGCACCGTGGGTGGTTCAA 

TCATTGGAGATTGTGCCACCCAAGGTCACGACTACTCCAATTGTCAQIA 

GTC^CGACTGTTCGAACGAGCACCACTGTTCCAACGACAACGACTGTTCCA^ 

GTTCCAACAACAATGAGCATTCCAACGACAACGACTGTTCCGACGACA^ 
20 ACGACAACGAGCGTTCCAACGACAACGAGCATTCCAACAACAAC^ 

ACAACGGTCrCTACCTTTGTTCCTCC^TGCCTTTGCCCA 

GCCACTTCACCATCTTCACCTCAGCC^GCAGAAACCCACCCTACXjACA 

ATAAGGAGAGAACCCACCAGCTCACCATTGTACTCTTACACAACAGATGGGAATGACACC 

GTGACAGAGTCTTCAGATGGCCTTTGGAATAACAATCAAA 
25 AGTCTACTGACGGCCAATACCACrAAAGGAATCTATGCrGGAGTCrc 

GTGCTTCTTGCTCTTTTGGGTGTCATCATTGC 

CAACAACTAAGTGTTTCATTTAGCAGCCnTCAAATTAAAGCTTTC 

AAGGAAGTCCAAGCAGAAGACAATATCTACATTGAGAATAGTCTTTATGCCACGGAC (SEQ ID NO: 27) 

30 Table 36. The polypeptide sequence of the protein coded by the insert In pCR2.1- 
CG57008-03-S843 15B 

SVKVGGEAGPSVTLPCHYSGAVTSMCWNRGSCSLFTCQNGIVWTO^ 
GDLS RRDVS LTI ENTAVSDSGVYCCRVEHRGWFNDMKI TVSLE I VP PKVTTTP I VTTVPT 
VTTVRTSTTVPTTTTVPTTTVPTTMS I PTTTTVPTTMTVSTTTSVPTTTS I PTTTSVPVT 
3 5 TTVSTFVPPMPLP RQNHE PVATS PS S PQPAETHPTTLQGA I RREPTS S PL YS YTTDGNDT 
VTES S DGL WNNNQTQL FL EHS LLTANTTKG I YAGVCI S VLVLLALLGV I IAKKYFPKKEV 
QQLSVS FSSLQI KALQNAVEKEVQAEDNI YI ENSLYATD (SEQ ID NO: 28) 

Example 2. Cloning of the extracellular domain of CG57008 

40 Oligonucleotide primers were designed to PCR amplify a DNA segment, 

representing an ORF, coding for the mature form of the extracellular domain of CG57008 
(NOV1), between residues 21 and 286. The forward primer includes, a BamHI restriction 
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site while the reverse primer contains an in-frame, Xhol restriction site for further 
subcloning purposes. The sequences of the primers are the following: 
HAVcr-1 FORW: 

GGATCCTCTGTAAAGGTTGGTGGAGAGGCAGGTCC (SEQ IDNO:29), 
5 HAVcr-1 SolubleREV: 

CTCGAGCAGTAGACTATGTTCTAGGAACAGTTGAG (SEQ ID NO:30). 

PCR reactions were set up using a total of 5 ng cDNA template containing equal 
parts of cDNA samples derived from human testis, human mammary, human skeletal 
muscle, and fetal brain; 1 pM of each of the HAVcr-1 FORW and HAVcr-1 SolubleREV 
10 primers, 5 micromoles dNTP (Clontech Laboratories, Palo Alto CA) and 1 microliter of 
50xAdvantage-HF 2 polymerase (Clontech Laboratories, Palo Alto CA) in 50 microliter 
volume. The following reaction conditions were used: 

a) 96°C 3 minutes 

b) 96°C 30 seconds denaturation 

15 c) 70°C 30 seconds, primer annealing. This temperature was gradually 

decreased by l°C/cycle 

d) 72°C 2 minutes extension. 
Repeat steps b-d 10 times 

e) 96°C 30 seconds denaturation 
20 f) 60°C 30 seconds annealing 

g) 72°C 2 minutes extension 
Repeat steps e-g 25 times 

h) 72°C 5 minutes final extension 

25 An approximately 750 bp large amplified product was isolated from agarose gel 

and ligated to pCR2.1 vector (Invitrogen, Carlsbad, CA). The cloned insert was 
sequenced, using vector specific, Ml 3 Forward(-40) and Ml 3 Reverse primers and 
verified as an open reading frame coding for the mature form of the extracellular domain 
of CG57008. The clone is called pCR2.1- CG57008-02-S841 J3A. 
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Table 3. The nucleotide sequence of the insert in pCR2.1- CG57008-02-S841 13A . 

TCTGTAAAGGTTGGTGGAGAGGCAGGTCCATCTGTCACACTACCCTGCCACTAC^ 
GCTGTCftCATQUVTGTGCTGGAATAG 
5 ATTGT CTGGACCAATGGAACC CACGT CAC CT AT CGGAAGGACACA CGCT AT AAGCT ATTG 
GGGGACCTTTCAAGAAGGGATGTCTCTTTGACCATAGAAAATA^ 
GGCXSTATATTGTTGCCGTGTTGAGCACCGTGGGTCGTTCAATG^ 
TCATTGGAGATTGTGCCACCCAAGGTCACGACTACTCCAATTGTCACAACTC 
GTCACGACTGTTCGAACGAGCACCACTGTTCCAACGACAAC^ 
1 0 GTTCCAACAACAATGAGCATTCCAACGACAACGACTGTTCCGACGAC^ 
ACGACAACGAGCXSTTCCAACGACAACGAGCATTCCAACAAC^ 
ACAACGGTCTCTACCTTTGTTCCTCCAATGCCTTTGCCCAG^ 
GCCACTTCACCATCTTCACCTCAGCCAGCAGAAACCC^CCCT^ 

ATAAGGAGAGAACCCACCAGCTCACCATTGTACTCTTACACAACAGATGGGAATGACACC 
1 5 GTGACAGAGTCTTCAGATGGCCITTGGAATAACAATCAAACT 
AGTCTACTG (SEQ ID NO: 31) 



Table 4. The polypeptide sequence of the protein coded bv the insert in pCR2.1- 
20 CG57008-02-S841 13A 

SVKVGGEAGPSVTLPCHY SGAVTSMCWNRGSCSLFTCQNG I VWTNGTHVTYRKDTR YKLL 
GDLSRRDVSLTIENTAVSDSGVYCCRVEHRGWFNDMKITVSLEIW 
VTTVRTSTTVPTTTTVPTTTVPTTMS I PTTTTVPTTMTVSTTTSVPTTTS I PTTTS VPVT 
TWSTFVPPMPLPRQNHEPVATSPSSPQPAETHPTTLQGAIRREPTSSPLYSYTTDGNDT 
25 VTESSDGLWNNNQTQLPLEHSLL {SEQ ID NO: 32) 

Example 3- Expression of CG57008-02 in Insect Cells J 

A 0.79 kb BamHI-XhoI fragment containing the CG57008-02 (NOVlc) sequence 
30 was subcloned to the baculovirus expression vector, pMelV5His (CuraGen Corporation) to 
generate plasmid 646. Expression studies were performed using the pBlueBac baculovirus 
expression system (Invitrogen Corporation) following the manufacturer's 
recommendation. The conditioned media was analyzed 48 hours after post-infection by 
Western blot using an anti-V5 antibody (Invitrogen). 

35 

Example 4. Expression of CG57008-02 in human embryonic kidney 293 cells 

A 0.79 kb BamHI-XhoI fragment containing the CG57008-02 (NOVlc) sequence 
was subcloned into BamHI-XhoI digested pCEP4/Sec to generate plasmid 754. The 
resulting plasmid 754 was transfected into 293 cells using the LipofectaminePlus reagent 
40 following the manufacturer's instructions (Gibco/BRL). The cell pellet and supernatant 
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were harvested 72h post transfection and examined for CG57008-02 expression by 
Western blot (reducing conditions) using an anti-V5 antibody. 

Example 5. Expression of CGS7008-02 in human embryonic kidney 293 cells 
5 The insert from pCR2. 1 - CG57008-02-S84 1_1 3 A (see Example 2) was subcloned 

to the pEE14.4Sec mammalian expression vector. The vector carries the glutamine 
synthase selective marker that allows the selection of stable clones in the presence of 
methionine sulfoximine. Following selection, stable clones were established. The 
CG57008-02 expression and secretion, in two independent clones, was demonstrated by 
10 Western blot analysis. 

Example 6. Identification of Single Nucleotide Polymorphisms in NOVX nucleic acid 
sequences 

Variant sequences are also included in this application. A variant sequence can 

15 include a single nucleotide polymorphism (SNP). A SNP can, in some instances, be 
referred to as a "cSNP" to denote that the nucleotide sequence containing the SNP 
originates as a cDNA. A SNP can arise in several ways. For example, a SNP can be due 
to a substitution of one nucleotide for another at the polymorphic site. Such a substitution 
can be either a transition or a transversion. A SNP can also arise from a deletion of a 

20 nucleotide or an insertion of a nucleotide, relative to a reference allele. In this case, the 
polymorphic site is a site at which one allele bears a gap with respect to a particular 
nucleotide in another allele. SNPs occurring within genes can result in an alteration of the 
amino acid encoded by the gene at the position of the SNP. Intragenic SNPs can also be 
silent, when a codon including a SNP encodes the same amino acid as a result of the 

25 redundancy of the genetic code. SNPs occurring outside the region of a gene, or in an 
intron within a gene, do not result in changes in any amino acid sequence of a protein but 
can result in altered regulation of the expression pattern. Examples include alteration in 
temporal expression, physiological response regulation, cell type expression regulation, 
intensity of expression, and stability of transcribed message. 

30 SeqCalling assemblies produced by the exon linking process were selected and 

extended using the following criteria. Genomic clones having regions with 98% identity 
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to all or part of the initial or extended sequence were identified by BLASTN searches 
using the relevant sequence to query human genomic databases. The genomic clones that 
resulted were selected for further analysis because this identity indicates that these clones 
contain the genomic locus for these SeqCalling assemblies. These sequences were 
analyzed for putative coding regions as well as for similarity to the known DNA and 
protein sequences. Programs used for these analyses include Grail, Genscan, BLAST, 
HMMER, FASTA, Hybrid and other relevant programs. 

Some additional genomic regions may have also been identified because selected 
SeqCalling assemblies map to those regions. Such SeqCalling sequences may have 
overlapped with regions defined by homology or exon prediction. They may also be 
included because the location of the fragment was in the vicinity of genomic regions 
identified by similarity or exon prediction that had been included in the original predicted 
sequence. The sequence so identified was manually assembled and then may have been 
extended using one or more additional sequences taken from CuraGen Corporation's 
human SeqCalling database. SeqCalling fragments suitable for inclusion were identified 
by the CuraTools™ program SeqExtend or by identifying SeqCalling fragments mapping 
to the appropriate regions of the genomic clones analyzed. 

The regions defined by the procedures described above were then manually 
integrated and corrected for apparent inconsistencies that may have arisen, for example, 
from miscalled bases in the original fragments or from discrepancies between predicted 
exon junctions, EST locations and regions of sequence similarity, to derive the final 
sequence disclosed herein. When necessary, the process to identify and analyze 
SeqCalling assemblies and genomic clones was reiterated to derive the full length 
sequence (Alderborn et aL, Determination of Single Nucleotide Polymorphisms by 
Real-time Pyrophosphate DNA Sequencing. Genome Research. 10 (8) 1249-1265, 2000). 

Variants are reported individually but any combination of all or a select subset of 
variants are also included as contemplated NOVX embodiments of the invention. 

SNP data for CG57008-O2: 

Four polymorphic variants of CG57008-02 have been identified and are shown 

below. 
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Table B: 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13382206 


203 


C 


T 


51 


Ser 


Leu 


13382207 


338 


C 


T 


96 


Ala 


Val 


13378266 


572 


c 


T 


174 


Pro 


Leu 


13378265 


655 


A 


G 


202 


Hit 


Ala 



5 Example 7. Quantitative expression analysis of CG57008-02 in various cells and 
tissues 

The quantitative expression of various clones was assessed using microtiter plates 
containing RNA samples from a variety of normal and pathology-derived cells, cell lines 
and tissues using real time quantitative PCR (RTQ PCR). RTQ PCR was performed on an 

10 Applied Biosystems ABI PRISM® 7700 or an ABI PRISM® 7900 HT Sequence 
Detection System. Various collections of samples are assembled on the plates, and 
referred to as Panel 1 (containing normal tissues and cancer cell lines), Panel 2 (containing 
samples derived from tissues from normal and cancer sources), Panel 3 (containing cancer 
cell lines), Panel 4 (containing cells and cell lines from normal tissues and cells related to 

IS inflammatory conditions), Panel 5D/5I (containing human tissues and cell lines with an 
emphasis on metabolic diseases), AI_comprehensive_panel (containing normal tissue and 
samples from autoinflammatory diseases), Panel CNSD.01 (containing samples from 
normal and diseased brains) and CNS__neurodegeneration_panel (containing samples from 
normal and Alzheimer's diseased brains). 

20 RNA integrity from all samples is controlled for quality by visual assessment of 

agarose gel electropherograms using 28S and 18S ribosomal RNA staining intensity ratio 
as a guide (2:1 to 2.5:1 28s: 18s) and the absence of low molecular weight RNAs that 
would be indicative of degradation products. Samples are controlled against genomic 
DNA contamination by RTQ PCR reactions run in the absence of reverse transcriptase 

25 using probe and primer sets designed to amplify across the span of a single exon. 
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First, the RNA samples were normalized to reference nucleic acids such as 
constitutively expressed genes (for example, P-actin and GAPDH). Normalized RNA (5 
ul) was converted to cDNA and analyzed by RTQ-PCR using One Step RT-PCR Master 
Mix Reagents (Applied Biosystems; Catalog No. 4309169) and gene-specific primers 

5 according to the manufacturer's instructions. 

In other cases, non-normalized RNA samples were converted to single strand 
cDNA (sscDNA) using Superscript II (Invitrogen Corporation; Catalog No. 18064-147) 
and random hexamers according to the manufacturer's instructions. Reactions containing 
up to 10 \ig of total RNA were performed in a volume of 20 \il and incubated for 60 

10 minutes at 42°C This reaction can be scaled up to 50 \ig of total RNA in a final volume of 
100 sscDNA samples are then normalized to reference nucleic acids as described 
previously, using IX TaqMan® Universal Master mix (Applied Biosystems; catalog No. 
4324020), following the manufacturer's instructions. 

Probes and primers were designed for each assay according to Applied Biosystems 

1 5 Primer Express Software package (version I for Apple Computer's Macintosh Power PC) 
or a similar algorithm using the target sequence as input. Default settings were used for 
reaction conditions and the following parameters were set before selecting primers: primer 
concentration = 250 nM, primer melting temperature (Tm) range = 58°-60°C, primer 
optimal Tm - 59°C, maximum primer difference = 2°C, probe does not have 5'G, probe 

20 Tm must be 1 0°C greater than primer Tm, amplicon size 75bp to lOObp. The probes and 
primers selected (see below) were synthesized by Synthegen (Houston, TX, USA). Probes 
were double purified by HPLC to remove uncoupled dye and evaluated by mass 
spectroscopy to verify coupling of reporter and quencher dyes to the 5* and 3* ends of the 
probe, respectively. Their final concentrations were: forward and reverse primers, 900nM 

25 each, and probe, 200nM. 

PCR conditions: When working with RNA samples, normalized RNA from each 
tissue and each cell line was spotted in each well of either a 96 well or a 3 84- well PCR 
plate (Applied Biosystems). PCR cocktails included either a single gene specific probe and 
primers set, or two multiplexed probe and primers sets (a set specific for the target clone 

30 and another gene-specific set multiplexed with the target probe). PCR reactions were set 
up using TaqMan® One-Step RT-PCR Master Mix (Applied Biosystems, Catalog No. 
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4313803) following manufacturer's instructions. Reverse transcription was performed at 
48°C for 30 minutes followed by amplification/PCR cycles as follows: 95°C 10 min, then 
40 cycles of 95°C for 1 5 seconds, 60°C for 1 minute. Results were recorded as CT values 
(cycle at which a given sample crosses a threshold level of fluorescence) using a log scale, 
5 with the difference in RNA concentration between a given sample and the sample with the 
lowest CT value being represented as 2 to the power of delta CT. The percent relative 
expression is then obtained by taking the reciprocal of this RNA difference and 
multiplying by 100. 

When working with sscDNA samples, normalized sscDNA was used as described 
10 previously for RNA samples. PCR reactions containing one or two sets of probe and 
primers were set up as described previously, using IX TaqMan® Universal Master mix 
(Applied Biosystems; catalog No. 4324020), following the manufacturer's instructions. 
PCR amplification was performed as follows: 95°C 10 min, then 40 cycles of 95°C for 15 
seconds, 60°C for 1 minute. Results were analyzed and processed as described previously. 

15 Panels 1, LI, 1-2, and 1 3D 

The plates for Panels 1, 1.1, 1.2 and 1. 3D include 2 control wells (genomic DNA 
control and chemistry control) and 94 wells containing cDNA from various samples. The 
samples in these panels are broken into 2 classes: samples derived from cultured cell lines 
and samples derived from primary normal tissues. The cell lines are derived from cancers 

20 of the following types: lung cancer, breast cancer, melanoma, colon cancer, prostate 

cancer, CNS cancer, squamous cell carcinoma, ovarian cancer, liver cancer, renal cancer, 
gastric cancer and pancreatic cancer. Cell lines used in these panels are widely available 
through the American Type Culture Collection (ATCC), a repository for cultured cell 
lines, and were cultured using the conditions recommended by the ATCC. The normal 

25 tissues found on these panels are comprised of samples derived from all major organ 
systems from single adult individuals or fetuses. These samples are derived from the 
following organs: adult skeletal muscle, fetal skeletal muscle, adult heart, fetal heart, adult 
kidney, fetal kidney, adult liver, fetal liver, adult lung, fetal lung, various regions of the 
brain, the spleen, bone marrow, lymph node, pancreas, salivary gland, pituitary gland, 

30 adrenal gland, spinal cord, thymus, stomach, small intestine, colon, bladder, trachea, 
breast, ovary, uterus, placenta, prostate, testis and adipose. 
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In the results for Panels 1, 1.1, 1.2 and 1.3D, the following abbreviations are used: 



ca. = carcinoma, 



5 



* = established from metastasis, 
met = metastasis, 
s cell var = small cell variant, 
non-s = non-sm = non-small, 



squam = squamous, 



10 



pi. eff = pi effusion = pleural effusion, 

glio = glioma, 

astro = astrocytoma, and 



neuro = neuroblastoma. 

General screening, panel v 1.4, vl.5, vl.6 and 1.7 

The plates for Panels 1 .4, 1 .5, 1 .6 and 1 .7 include 2 control wells (genomic DNA 
control and chemistry control) and 88 to 94 wells containing cDNA from various samples. 

1 5 The samples in Panels 1 .4, 1 .5, 1 .6 and 1 .7 are broken into 2 classes: samples derived from 
cultured cell lines and samples derived from primary normal tissues. The cell lines are 
derived from cancers of the following types: lung cancer, breast cancer, melanoma, colon 
cancer, prostate cancer, CNS cancer, squamous cell carcinoma, ovarian cancer, liver 
cancer, renal cancer, gastric cancer and pancreatic cancer. Cell lines used in Panels 1 .4, 

20 1 .5, 1 .6 and 1 .7 are widely available through the American Type Culture Collection 
(ATCC), a repository for cultured cell lines, and were cultured using the conditions 
recommended by the ATCC. The normal tissues found on Panels 1.4, 1.5, 1.6 and 1.7 are 
comprised of pools of samples derived from all major organ systems from 2 to 5 different 
adult individuals or fetuses. These samples are derived from the following organs: adult 

25 skeletal muscle, fetal skeletal muscle, adult heart, fetal heart, adult kidney, fetal kidney, 
adult liver, fetal liver, adult lung, fetal lung, various regions of the brain, the spleen, bone 
marrow, lymph node, pancreas, salivary gland, pituitary gland, adrenal gland, spinal cord, 
thymus, stomach, small intestine, colon, bladder, trachea, breast, ovary, uterus, placenta, 
prostate, testis and adipose. Abbreviations are as described for Panels 1, 1. 1, 1.2, and 



30 



1.3D. 



Panels 2D, 2.2, 23 and 2.4 
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The plates for Panels 2D, 2.2, 2.3 and 2.4 generally include 2 control wells and 94 
test samples composed of RNA or cDNA isolated from human tissue procured by 
surgeons working in close cooperation with the National Cancer Institute's Cooperative 
Human Tissue Network (CHTN) or the National Disease Research Initiative (NDRI) or 
5 from Ardais or Clinomics). The tissues are derived from human malignancies and in cases 
where indicated many malignant tissues have "matched margins" obtained from 
noncancerous tissue just adjacent to the tumor. These are termed normal adjacent tissues 
and are denoted "NAT" in the results below. The tumor tissue and the "matched margins" 
are evaluated by two independent pathologists (the surgical pathologists and again by a 

10 pathologist at NDRI/ CHTN/Ardais/Clinomics). Unmatched RNA samples from tissues 
without malignancy (normal tissues) were also obtained from Ardais or Clinomics. This 
analysis provides a gross histopathological assessment of tumor differentiation grade. 
Moreover, most samples include the original surgical pathology report that provides 
information regarding the clinical stage of the patient These matched margins are taken 

15 from the tissue surrounding (i.e. immediately proximal) to the zone of surgery (designated 
"NAT", for normal adjacent tissue, in Table RR). In addition, RNA and cDNA samples 
were obtained from various human tissues derived from autopsies performed on elderly 
people or sudden death victims (accidents, etc.). These tissues were ascertained to be free 
of disease and were purchased from various commercial sources such as Clontech (Palo 

20 Alto, CA), Research Genetics, and Invitrogen. 
HASS Panel v 1.0 

The HASS panel v 1.0 plates are comprised of 93 cDNA samples and two controls. 
Specifically, 81 of these samples are derived from cultured human cancer cell lines that 
had been subjected to serum starvation, acidosis and anoxia for different time periods as 

25 well as controls for these treatments, 3 samples of human primary cells, 9 samples of 
malignant brain cancer (4 medulloblastomas and 5 glioblastomas) and 2 controls. The 
human cancer cell lines are obtained from ATCC (American Type Culture Collection) and 
fall into the following tissue groups: breast cancer, prostate cancer, bladder carcinomas, 
pancreatic cancers and CNS cancer cell lines. These cancer cells are all cultured under 

30 standard recommended conditions. The treatments used (serum starvation, acidosis and 
anoxia) have been previously published in the scientific literature. The primary human 
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cells were obtained from Clonetics (Walkersville, MD) and were grown in the media and 
conditions recommended by Clonetics. The malignant brain cancer samples are obtained 
as part of a collaboration (Henry Ford Cancer Center) and are evaluated by a pathologist 
prior to CuraGen receiving the samples . RNA was prepared from these samples using the 
5 standard procedures. The genomic and chemistry control wells have been described 
previously. 

ARDAIS Panel v 1.0 

The plates for ARDAIS panel v 1 .0 generally include 2 control wells and 22 test 
samples composed of RNA isolated from human tissue procured by surgeons working in 

10 close cooperation with Aidais Corporation. The tissues are derived from human lung 

malignancies (lung adenocarcinoma or lung squamous cell carcinoma) and in cases where 
indicated many malignant samples have "matched margins" obtained from noncancerous 
lung tissue just adjacent to the tumor. These matched margins are taken from the tissue 
surrounding (i.e. immediately proximal) to the zone of surgery (designated "NAT", for 

1 5 normal adjacent tissue) in the results below. The tumor tissue and the "matched margins" 
are evaluated by independent pathologists (the surgical pathologists and again by a 
pathologist at Ardais). Unmatched malignant and non-malignant RNA samples from lungs 
were also obtained from Ardais. Additional information from Ardais provides a gross 
histopathological assessment of tumor differentiation grade and stage. Moreover, most 

20 samples include the original surgical pathology report that provides information regarding 
the clinical state of the patient. 

ARDAIS Prostate/Kidney/Lung v 1.0 

The plates for ARDAIS prostate, kidney, and lung 1.0 respectively, generally 
include 2 control wells and 68 test samples composed of RNA isolated from human tissue 

25 procured by surgeons working in close cooperation with Ardais Corporation. The tissues 
are derived from human prostate, kidney, or lung malignancies and in cases where 
indicated malignant samples have "matched margins" obtained from noncancerous 
prostate, kidney, or lung tissue just adjacent to the tumor. These matched margins are 
taken from the tissue surrounding (i.e. immediately proximal) to the zone of surgery 

30 (designated "NAP\ for normal adjacent tissue) in the results below. The tumor tissue and 
the "matched margins" are evaluated by independent pathologists (the surgical 
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pathologists and again by a pathologist at Ardais). RNA from unmatched malignant and 
non-malignant prostate, kidney, or lung samples were also obtained from Ardais. 
Additional information from Ardais provides a gross histopathological assessment of 
tumor differentiation grade and stage. Moreover, most samples include the original 
5 surgical pathology report that provides information regarding the clinical state of the 
patient. 

Panel 3D, 3.1 and 3.2 

The plates of Panel 3D, 3.1, and 3.2 are comprised of 94 cDNA samples and two 
control samples. Specifically, 92 of these samples are derived from cultured human cancer 

10 cell lines, 2 samples of human primary cerebellar tissue and 2 controls. The human cell 
lines are generally obtained from ATCC (American Type Culture Collection), NCI or the 
German tumor cell bank and fall into the following tissue groups: Squamous cell 
carcinoma of the tongue, breast cancer, prostate cancer, melanoma, epidermoid carcinoma, 
sarcomas, bladder carcinomas, pancreatic cancers, kidney cancers, leukemias/lymphomas, 

15 ovarian/uterine/cervical, gastric, colon, lung and CNS cancer cell lines. In addition, there 
are two independent samples of cerebellum. These cells are all cultured under standard 
recommended conditions and RNA extracted using the standard procedures. The cell lines 
in panel 3D, 3.1, 3.2, 1, 1.1., 12, 1.3D, 1.4, 1.5, and 1.6 are of the most common cell lines 
used in the scientific literature. 

20 Panels 4D, 4R, and 4.1D 

Panel 4 includes samples on a 96 well plate (2 control wells, 94 test samples) 
composed of RNA (Panel 4R) or cDNA (Panels 4D/4.1D) isolated from various human 
cell lines or tissues related to inflammatory conditions. Total RNA from control normal 
tissues such as colon and lung (Stratagene, La Jolla, CA) and thymus and kidney 

25 (Clontech) was employed. Total RNA from liver tissue from cirrhosis patients and kidney 
from lupus patients was obtained from BioChain (Biochain Institute, Inc., Hayward, CA). 
Intestinal tissue for RNA preparation from patients diagnosed as having Crohn's disease 
and ulcerative colitis was obtained from the National Disease Research Interchange 
(NDRI) (Philadelphia, PA). 

30 Astrocytes, lung fibroblasts, dermal fibroblasts, coronary artery smooth muscle 

cells, small airway epithelium, bronchial epithelium, microvascular dermal endothelial 
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cells, microvascular lung endothelial cells, human pulmonary aortic endothelial cells, 
human umbilical vein endothelial cells were all purchased from Clonetics (Walkersville, 
MD) and grown in the media supplied for these cell types by Clonetics. These primary cell 
types were activated with various cytokines or combinations of cytokines for 6 and/or 12- 
5 14 hours, as indicated. The following cytokines were used; IL-1 beta at approximately 1- 
5ng/ml, TNF alpha at approximately 5-10ng/ml, IFN gamma at approximately 20- 
50ng/ml, IL-4 at approximately 5-10ng/ml, IL-9 at approximately 5-10ng/ml, IL-13 at 
approximately 5-10ng/ml. Endothelial cells were sometimes starved for various times by 
culture in the basal media from Clonetics with 0. 1% serum. 

10 Mononuclear cells were prepared from blood of employees at CuraGen 

Corporation, using Ficoll. LAK cells were prepared from these cells by culture in DMEM 
5% FCS (Hyclone), 100nM non essential amino acids (Gibco/Life Technologies, 
Rockville, MD), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x1 0* 5 M (Gibco), and 
lOmM Hepes (Gibco) and Interleukin 2 for 4-6 days. Cells were then either activated with 

15 10-20ng/ml PMA and l-2jig/ml ionomycin, IL-12 at 5-10ng/ml, IFN gamma at 20- 
50ng/ml and IL-1 8 at 5-10ng/ml for 6 hours. In some cases, mononuclear cells were 
cultured for 4-5 days in DMEM 5% FCS (Hyclone), lOOpM non essential amino acids 
(Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xl0" 5 M (Gibco), and lOmM 
Hepes (Gibco) with PHA (phytohemagglutinin) or PWM (pokeweed mitogen) at 

20 approximately 5fig/mL Samples were taken at 24, 48 and 72 hours for RNA preparation. 
MLR (mixed lymphocyte reaction) samples were obtained by taking blood from two 
donors, isolating the mononuclear cells using Ficoll and mixing die isolated mononuclear 
cells 1:1 at a final concentration of approximately 2xl0 6 cells/ml in DMEM 5% FCS 
(Hyclone), 100^M non essential amino acids (Gibco), ImM sodium pyruvate (Gibco), 

25 mercaptoethanol (5.5xl0" 5 M) (Gibco), and lOmM Hepes (Gibco). The MLR was cultured 
and samples taken at various time points ranging from 1-7 days for RNA preparation. 

Monocytes were isolated from mononuclear cells using CD 14 Miltenyi Beads, +ve 
VS selection columns and a Vario Magnet according to the manufacturer's instructions. 
Monocytes were differentiated into dendritic cells by culture in DMEM 5% fetal calf 

30 serum (FCS) (Hyclone, Logan, UT), IOOjiM non essential amino acids (Gibco), ImM 

sodium pyruvate (Gibco), mercaptoethanol 5.5xl0* 5 M (Gibco), and lOmM Hepes (Gibco), 
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50ng/ml GMCSF and 5ng/ml IL-4 for 5-7 days. Macrophages were prepared by culture of 
monocytes for 5-7 days in DMEM 5% FCS (Hyclone), IOOjiM non essential amino acids 
(Gibco), lmM sodium pyruvate (Gibco), mercaptoethanol 5.5xl0" 5 M (Gibco), lOmM 
Hepes (Gibco) and 10% AB Human Serum or MCSF at approximately 50ng/ml. 

5 Monocytes, macrophages and dendritic cells were stimulated for 6 and 12-14 hours with 
lipopolysaccharide (LPS) at lOOng/ml. Dendritic cells were also stimulated with anti- 
CD40 monoclonal antibody (Pharmingen) at 10ng/ml for 6 and 12-14 hours. 

CD4 lymphocytes, CD8 lymphocytes and NK cells were also isolated from 
mononuclear cells using CD4, CD8 and CD56 Miltenyi beads, positive VS selection 

10 columns and a Vario Magnet according to the manufacturer's instructions. CD45RA and 
CD45RO CD4 lymphocytes were isolated by depleting mononuclear cells of CD8, CD56, 
CD14 and CD19 cells using CD8, CD56, CD14 and CD19 Miltenyi beads and positive 
selection. CD45RO beads were then used to isolate the CD45RO CD4 lymphocytes with 
the remaining cells being CD45RA CD4 lymphocytes. CD45RA CD4, CD45RO CD4 and 

1 5 CD8 lymphocytes were placed in DMEM 5% FCS (Hyclone), 1 00[iM non essential amino 
acids (Gibco), lmM sodium pyruvate (Gibco), mercaptoethanol 5.5xl0~ 5 M (Gibco), and 
lOmM Hepes (Gibco) and plated at 10 6 cells/ml onto Falcon 6 well tissue culture plates 
that had been coated overnight with 0.5ng/ml anti-CD28 (Pharmingen) and 3ug/ml anti- 
CD3 (OKT3, ATCC) in PBS. After 6 and 24 hours, the cells were harvested for RNA 

20 preparation. To prepare chronically activated CD8 lymphocytes, we activated the isolated 
CD8 lymphocytes for 4 days on anti-CD28 and anti-CD3 coated plates and then harvested 
the cells and expanded them in DMEM 5% FCS (Hyclone), IOOjiM non essential amino 
acids (Gibco), lmM sodium pyruvate (Gibco), mercaptoethanol 5.5xlO" 5 M (Gibco), and 
lOmM Hepes (Gibco) and IL-2. The expanded CD8 cells were then activated again with 

25 plate bound anti-CD3 and anti-CD28 for 4 days and expanded as before. RNA was 
isolated 6 and 24 hours after the second activation and after 4 days of the second 
expansion culture. The isolated NK cells were cultured in DMEM 5% FCS (Hyclone), 
lOOjiM non essential amino acids (Gibco), lmM sodium pyruvate (Gibco), 
mercaptoethanol 5.5xl0" 5 M (Gibco), and lOmM Hepes (Gibco) and IL-2 for 4-6 days 

30 before RNA was prepared. 
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To obtain B cells, tonsils were procured from NDRI. The tonsil was cut up with 
sterile dissecting scissors and then passed through a sieve. Tonsil cells were then spun 
down and resupended at 10 6 ce!ls/ml in DMEM 5% FCS (Hyclone), 100nM non essential 
amino acids (Gibco), lmM sodium pyruvate (Gibco), mercaptoethanol 5.5xl0" 5 M (Gibco), 

5 and lOmM Hepes (Gibco). To activate the cells, we used PWM at 5\xg/ml or anti-CD40 
(Pharmingen) at approximately lO^g/ml and IL-4 at 5-10ng/ml. Cells were harvested for 
RNA preparation at 24,48 and 72 hours. 

To prepare the primary and secondary Thl/Th2 and Trl cells, six-well Falcon 
plates were coated overnight with lOjig/ml anti-CD28 (Pharmingen) and 2jig/ml OKT3 

10 (ATCC), and then washed twice with PBS. Umbilical cord blood CD4 lymphocytes 
(Poietic Systems, German Town, MD) were cultured at 10 5 -10 6 cells/ml in DMEM 5% 
FCS (Hyclone), 100nM non essential amino acids (Gibco), lmM sodium pyruvate 
(Gibco), mercaptoethanol 5.5xl0' 5 M (Gibco), lOmM Hepes (Gibco) and IL-2 (4ng/ml). 
IL-12 (5ng/ml) and anti-IL4 (lug/ml) were used to direct to Thl, while IL-4 (5ng/ml) and 

15 anti-IFN gamma (lug/ml) were used to direct to Th2 and IL-10 at 5ng/ml was used to 
direct to Trl. After 4-5 days, the activated Thl, Th2 and Trl lymphocytes were washed 
once in DMEM and expanded for 4-7 days in DMEM 5% FCS (Hyclone), lOO^M non 
essential amino acids (Gibco), lmM sodium pyruvate (Gibco), mercaptoethanol 5.5x10" 
5 M (Gibco), lOmM Hepes (Gibco) and IL-2 (lng/ml). Following this, the activated Thl, 

20 Th2 and Trl lymphocytes were re-stimulated for 5 days with anti-CD28/OKT3 and 
cytokines as described above, but with the addition of anti-CD95L (lug/ml) to prevent 
apoptosis. After 4-5 days, the Thl, Th2 and Trl lymphocytes were washed and then 
expanded again with IL-2 for 4-7 days. Activated Thl and Th2 lymphocytes were 
maintained in this way for a maximum of three cycles. RNA was prepared from primary 

25 and secondary Thl , Th2 and Trl after 6 and 24 hours following the second and third 
activations with plate bound anti-CD3 and anti-CD28 mAbs and 4 days into the second 
and third expansion cultures in Interleukin 2. 

The following leukocyte cells lines were obtained from the ATCC: Ramos, EOL-1, 
KU-812. EOL cells were further differentiated by culture in O.lmM dbcAMP at 

30 5xl0 5 cells/ml for 8 days, changing the media every 3 days and adjusting the cell 

concentration to 5xl0 5 cells/ml. For the culture of these cells, we used DMEM or RPMI (as 
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recommended by the ATCC), with the addition of 5% FCS (Hyclone), 100nM non 
essential amino acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x10' 
5 M (Gibco), lOmM Hepes (Gibco). RNA was either prepared from resting cells or cells 
activated with PMA at lOng/ml and ionomycin at 1 ng/ml for 6 and 14 hours. Keratinocyte 

5 line CCD 106 and an airway epithelial tumor line NCI-H292 were also obtained from the 
ATCC. Both were cultured in DMEM 5% FCS (Hyclone), lOO^M non essential amino 
acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xl0~ 5 M (Gibco), and 
lOmM Hepes (Gibco). CCD1 106 cells were activated for 6 and 14 hours with 
approximately 5 ng/ml TNF alpha and lng/ml IL-1 beta, while NCI-H292 cells were 

10 activated for 6 and 14 hours with the following cytokines: 5ng/ml IL-4, 5ng/ml IL-9, 
5ng/ml IL-1 3 and 25ng/ml IFN gamma. 

For these cell lines and blood cells, RNA was prepared by lysing approximately 
10 7 cells/ml using Trizol (Gibco BRL). Briefly, 1/10 volume of bromochloropropane 
(Molecular Research Corporation) was added to the RNA sample, vortexed and after 10 

15 minutes at room temperature, the tubes were spun at 14,000 rpm in a Sorvall SS34 rotor. 
The aqueous phase was removed and placed in a 15ml Falcon Tube. An equal volume of 
isopropanol was added and left at -20°C overnight. The precipitated RNA was spun down 
at 9,000 rpm for 15 min in a Sorvall SS34 rotor and washed in 70% ethanol. The pellet 
was redissolved in 300nl of RNAse-free water and 35(il buffer (Promega) S\il DTT, 7\il 

20 RNAsin and 8jxl DNAse were added. The tube was incubated at 37°C for 30 minutes to 
remove contaminating genomic DNA, extracted once with phenol chloroform and re- 
precipitated with 1/10 volume of 3M sodium acetate and 2 volumes of 100% ethanol. The 
RNA was spun down and placed in RNAse free water. RNA was stored at -80°C. 

Alcomprehensive panel_vl.O 

25 The plates for AI_comprehensive panel_vl .0 include two control wells and 89 test 

samples comprised of cDNA isolated from surgical and postmortem human tissues 
obtained from the Backus Hospital and Clinomics (Frederick, MD). Total RNA was 
extracted from tissue samples from the Backus Hospital in the Facility at CuraGen. Total 
RNA from other tissues was obtained from Clinomics. 

30 Joint tissues including synovial fluid, synovium, bone and cartilage were obtained 

from patients undergoing total knee or hip replacement surgery at the Backus Hospital. 
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Tissue samples were immediately snap frozen in liquid nitrogen to ensure that isolated 
RNA was of optimal quality and not degraded. Additional samples of osteoarthritis and 
rheumatoid arthritis joint tissues were obtained from Clinomics. Normal control tissues 
were supplied by Clinomics and were obtained during autopsy of trauma victims. 
5 Surgical specimens of psoriatic tissues and adjacent matched tissues were provided 

as total RNA by Clinomics. Two male and two female patients were selected between the 
ages of 25 and 47. None of the patients were taking prescription drugs at the time samples 
were isolated. 

Surgical specimens of diseased colon from patients with ulcerative colitis and 
10 Crohns disease and adjacent matched tissues were obtained from Clinomics. Bowel tissue 
from three female and three male Crohn's patients between the ages of 41-69 were used. 
Two patients were not on prescription medication while the others were taking 
dexamethasone, phenobarbital, or tylenol. Ulcerative colitis tissue was from three male 
and four female patients. Four of the patients were taking lebvid and two were on 
15 phenobarbital. 

Total RNA from post mortem lung tissue from trauma victims with no disease or 
with emphysema, asthma or COPD was purchased from Clinomics. Emphysema patients 
ranged in age from 40-70 and all were smokers, this age range was chosen to focus on 
patients with cigarette-linked emphysema and to avoid those patients with alpha- lanti- 

20 trypsin deficiencies. Asthma patients ranged in age from 36-75, and excluded smokers to 
prevent those patients that could also have COPD. COPD patients ranged in age from 35- 
80 and included both smokers and non-smokers. Most patients were taking corticosteroids, 
and bronchodilators. 

In the labels employed to identify tissues in the AI_comprehensive panel_vl.O 

25 panel, the following abbreviations are used: 

AI = Autoimmunity 
Syn = Synovial 

Normal = No apparent disease 
Rep22 /Rep20 = individual patients 
30 RA = Rheumatoid arthritis 

Backus = From Backus Hospital 
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OA = Osteoarthritis 
(SS) (BA) (MF) = Individual patients 
Adj = Adjacent tissue 
Match control = adjacent tissues 
5 -M = Male 
-F = Female 

COPD = Chronic obstructive pulmonary disease 
AL05 chondrosarcoma 

The AI.05 chondrosarcoma plates are comprised of SW1353 cells that had been 
1 0 subjected to serum starvation and treatment with cytokines that are known to induce MMP 
(1, 3 and 13) synthesis (e.g. ILlbeta). These treatments include: IL-lbeta (10 ng/ml), IL- 
Ibeta + TNF-alpha (50 ng/ml), IL-lbeta + Oncostatin (50 ng/ml) and PMA (100 ng/ml). 
The SW1 353 cells were obtained from the ATCC (American Type Culture Collection) and 
were all cultured under standard recommended conditions. The SW1353 cells were plated 
15 at 3 xlO 5 cells/ml (in DMEM medium-10 % FBS) in 6-well plates. The treatment was 
done in triplicate, for 6 and 18 h. The supernatants were collected for analysis of MMP 1, 
3 and 13 production and for RNA extraction. RNA was prepared from these samples using 
the standard procedures. 
Panels 5D and 51 

20 The plates for Panel 5D and 51 include two control wells and a variety of cDN As 

isolated from human tissues and cell lines with an emphasis on metabolic diseases. 
Metabolic tissues were obtained from patients enrolled in the Gestational Diabetes study. 
Cells were obtained during different stages in the differentiation of adipocytes from 
human mesenchymal stem cells. Human pancreatic islets were also obtained. 

25 In the Gestational Diabetes study, subjects are young (1 8 - 40 years), otherwise 

healthy women with and without gestational diabetes undergoing routine (elective) 
Caesarcan section. After delivery of the infant, when the surgical incisions were being 
repaired/closed, the obstetrician removed a small sample (less than 1 cc) of the exposed 
metabolic tissues during the closure of each surgical level. The biopsy material was rinsed 

30 in sterile saline, blotted and fast frozen within 5 minutes from the time of removal. The 
tissue was then flash frozen in liquid nitrogen and stored, individually, in sterile screw-top 
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tubes and kept on dry ice for shipment to or to be picked up by CuraGen. The metabolic 
tissues of interest include uterine wall (smooth muscle), visceral adipose, skeletal muscle 
(rectus) and subcutaneous adipose. Patient descriptions are as follows: 

Patient 2: Diabetic Hispanic, overweight, not on insulin 

5 Patient 7-9: Nondiabetic Caucasian and obese (BMI>30) 
Patient 10: Diabetic Hispanic, overweight, on insulin 
Patient 1 1 : Nondiabetic African American and overweight 
Patient 12: Diabetic Hispanic on insulin 

Adipocyte differentiation was induced in donor progenitor cells obtained from 

10 Osirus (a division of Clonetics/BioWhittaker) in triplicate, except for Donor 3U which had 
only two replicates. Scientists at Clonetics isolated, grew and differentiated human 
mesenchymal stem cells (HuMSCs) for CuraGen based on the published protocol found in 
Mark F. Pittenger, et al., Multilineage Potential of Adult Human Mesenchymal Stem Cells 
Science Apr 2 1999: 143-147. Clonetics provided Trizol lysates or frozen pellets suitable 

1 5 for mRNA isolation and ds cDNA production. A general description of each donor is as 
follows: 

Donor 2 and 3 U: Mesenchymal Stem cells, Undifferentiated Adipose 
Donor 2 and 3 AM: Adipose, AdiposeMidway Differentiated 
Donor 2 and 3 AD: Adipose, Adipose Differentiated 
20 Human cell lines were generally obtained from ATCC (American Type Culture 

Collection), NCI or the German tumor cell bank and fall into the following tissue groups: 
kidney proximal convoluted tubule, uterine smooth muscle cells, small intestine, liver 
HepG2 cancer cells, heart primary stromal cells, and adrenal cortical adenoma cells. These 
cells are all cultured under standard recommended conditions and RNA extracted using 
25 the standard procedures. All samples were processed at CuraGen to produce single 
stranded cDNA. 

Panel 51 contains all samples previously described with the addition of pancreatic 
islets from a 58 year old female patient obtained from the Diabetes Research Institute at 
the University of Miami School of Medicine. Islet tissue was processed to total RNA at an 
30 outside source and delivered to CuraGen for addition to panel 51. 
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In the labels employed to identify tissues in the 5D and 51 panels, the following 
abbreviations are used: 

GO Adipose = Greater Omentum Adipose 
SK= Skeletal Muscle 
5 UT - Uterus 

PL = Placenta 

AD = Adipose Differentiated 

AM = Adipose Midway Differentiated 

U = Undifferentiated Stem Cells 

1 0 Human Metabolic RTQ-PCR Panel 

The plates for the Human Metabolic RTQ-PCR Panel include two control wells 
(genomic DNA control and chemistry control) and 211 cDNAs isolated from human 
tissues and cell lines with an emphasis on metabolic diseases. This panel is useful for 
establishing the tissue and cellular expression profiles for genes believed to play a role in 

15 the etiology and pathogenesis of obesity and/or diabetes and to confirm differential 
expression of such genes derived from other methods. Metabolic tissues were obtained 
from patients enrolled in the CuraGen Gestational Diabetes study and from autopsy tissues 
from Type II diabetics and age, sex and race-matched control patients. One or more of the 
following were used to characterize the patients: body mass index [BMI = wt (kg) / ht 

20 (m 2 )], semm glucose, HgbAlc. Cell lines used in this panel are widely available through 
the American Type Culture Collection (ATCC), a repository for cultured cell lines. RNA 
from human Pancreatic Islets was also obtained. 

In the Gestational Diabetes study, subjects are young (18-40 years), otherwise 
healthy women with and without gestational diabetes undergoing routine (elective) 

25 Caesarian section. After delivery of the infant, when the surgical incisions were being 
repaired/closed, the obstetrician removed a small sample (less than lcc) of the exposed 
metabolic tissues during the closure of each surgical level. The biopsy material was rinsed 
in sterile saline, blotted, and then flash frozen in liquid nitrogen and stored, individually, 
in sterile screw-top tubes and kept on dry ice for shipment to or to be picked up by 

30 CuraGen. The metabolic tissues of interest include uterine wall (smooth muscle), visceral 
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adipose, skeletal muscle (rectus), and subcutaneous adipose. Patient descriptions are as 
follows: 

Patient 7 - Non-diabetic Caucasian and obese 

Patient 8 - Non-diabetic Caucasian and obese 
5 Patient 12 - Diabetic Caucasian with unknown BMI and on insulin 

Patient 13 - Diabetic Caucasian, overweight, not on insulin 

Patient 15 - Diabetic Caucasian, obese, not on insulin 

Patient 17 - Diabetic Caucasian, normal weight, not on insulin 

Patient 18 - Diabetic Hispanic, obese, not on insulin 
10 Patient 19 - Non-diabetic Caucasian and normal weight 

Patient 20 - Diabetic Caucasian, overweight, and on insulin 

Patient 21 - Non-diabetic Caucasian and overweight 

Patient 22 - Diabetic Caucasian, normal weight, on insulin 

Patient 23 - Non-diabetic Caucasian and overweight 
1 5 Patient 25 - Diabetic Caucasian, normal weight, not on insulin 

Patient 26 - Diabetic Caucasian, obese, on insulin 

Patient 27 - Diabetic Caucasian, obese, on insulin 

Total RNA was isolated from metabolic tissues of 12 Type II diabetic patients and 
12 matched control patients included hypothalamus, liver, pancreas, small intestine, psoas 
20 muscle, diaphragm muscle, visceral adipose, and subcutaneous adipose. The diabetics and 
non-diabetics were matched for age, sex, ethnicity, and BMI where possible. 

The panel also contains pancreatic islets from a 22 year old male patient (with a 
BMI of 35) obtained from the Diabetes Research Institute at the University of Miami 
School of Medicine. Islet tissue was processed to total RNA at CuraGen. 
25 Cell lines used in this panel are widely available through the American Type 

Culture Collection (ATCC), a repository for cultured cell lines, and were cultured at an 
outside facility. The RNA was extracted at CuraGen according to CuraGen protocols. All 
samples were then processed at CuraGen to produce single stranded cDNA. 

In the labels used to identify tissues in the Human Metabolic panel, the following 
30 abbreviations are used: 



151 



WO 03/080856 



PCT/US03/08490 



PI = placenta 

Go = greater omentum 

Sk = skeletal muscle 

Ut = uterus 
5 CC = Caucasian 

HI = Hispanic 

AA = African American 

AS = Asian 

Diab = Type II diabetic 
1 0 Norm 0 Non-diabetic 

Overwt = Overweight; med BMI 

Obese = Hi BMI 

Low BM - 20-25 

Med BM = 26-30 
1 5 Hi BMI = Greater than 30 

M - Male 

# = Patient identifier 
Vis. = Visceral 
SubQ = Subcutaneous 

20 Panel CNSD.01 

The plates for Panel CNSD.01 include two control wells and 94 test samples 
comprised of cDNA isolated from postmortem human brain tissue obtained from the 
Harvard Brain Tissue Resource Center. Brains are removed from calvaria of donors 
between 4 and 24 hours after death, sectioned by neuroanatomists, and frozen at -80°C in 

25 liquid nitrogen vapor. All brains are sectioned and examined by neuropathologists to 
confirm diagnoses with clear associated neuropathology. 

Disease diagnoses are taken from patient records. The panel contains two brains 
from each of the following diagnoses: Alzheimer's disease, Parkinson's disease, 
Huntington's disease, Progressive Supernuclear Palsy, Depression, and "Normal controls" 

30 Within each of these brains, the following regions are represented: cingulate gyrus, 

temporal pole, globus palladus, substantia nigra, Brodman Area 4 (primary motor strip), 
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Brodman Area 7 (parietal cortex), Brodman Area 9 (prefrontal cortex), and Brodman area 
17 (occipital cortex). Not all brain regions are represented in all cases; e.g., Huntington's 
disease is characterized in part by neurodegeneration in the globus palladus, thus this 
region is impossible to obtain from confirmed Huntington's cases. Likewise Parkinson's 
S disease is characterized by degeneration of the substantia nigra making this region more 
difficult to obtain. Normal control brains were examined for neuropathology and found to 
be free of any pathology consistent with neurodegeneration. 

In the labels employed to identify tissues in the CNS panel, the following 
abbreviations are used: 
1 0 PSP = Progressive supranuclear palsy 

Sub Nigra = Substantia nigra 

Glob Palladus= Globus palladus 

Temp Pole = Temporal pole 

Cing Gyr = Cingulate gyrus 
15 BA 4 = Brodman Area 4 

Panel CNS_Neurodegeneration_V1.0 

The plates for Panel CNS_Neurodegeneration_V1.0 include two control wells and 
47 test samples comprised of cDNA isolated from postmortem human brain tissue 
obtained from the Harvard Brain Tissue Resource Center (McLean Hospital) and the 
20 Human Brain and Spinal Fluid Resource Center (VA Greater Los Angeles Healthcare 
System). Brains are removed from calvaria of donors between 4 and 24 hours after death, 
sectioned by neuroanatomists, and frozen at -80°C in liquid nitrogen vapor. All brains are 
sectioned and examined by neuropathologists to confirm diagnoses with clear associated 
neuropathology. 

25 Disease diagnoses are taken from patient records. The panel contains six brains 

from Alzheimer's disease (AD) patients, and eight brains from "Normal controls" who 
showed no evidence of dementia prior to death. The eight normal control brains are 
divided into two categories: Controls with no dementia and no Alzheimer's like pathology 
(Controls) and controls with no dementia but evidence of severe Alzheimer's like 

30 pathology, (specifically senile plaque load rated as level 3 on a scale of 0-3; 0 = no 

evidence of plaques, 3 = severe AD senile plaque load). Within each of these brains, the 
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following regions are represented: hippocampus, temporal cortex (Brodman Area 21), 
parietal cortex (Brodman area 7), and occipital cortex (Brodman area 17). These regions 
were chosen to encompass all levels of neurodegeneration in AD. The hippocampus is a 
region of early and severe neuronal loss in AD; the temporal cortex is known to show 
5 neurodegeneration in AD after the hippocampus; the parietal cortex shows moderate 
neuronal death in the late stages of the disease; the occipital cortex is spared in AD and 
therefore acts as a "control" region within AD patients. Not all brain regions are 
represented in all cases. 

In the labels employed to identify tissues in the CNS^Neurodegeneration^Vl.O 
10 panel, the following abbreviations are used: 

AD = Alzheimer's disease brain; patient was demented and showed AD-like 

pathology upon autopsy 

Control = Control brains; patient not demented, showing no neuropathology 
Control (Path) = Control brains; pateint not demented but showing sever AD-like 
15 pathology 

SupTemporal Ctx = Superior Temporal Cortex 
Inf Temporal Ctx = Inferior Temporal Cortex 

Panel CNS_Neurodegeneration_V2.0 

20 The plates for Panel CNS_Neurodegeneration_V2.0 include two control wells and 

47 test samples comprised of cDNA isolated from postmortem human brain tissue 
obtained from the Harvard Brain Tissue Resource Center (McLean Hospital) and the 
Human Brain and Spinal Fluid Resource Center (VA Greater Los Angeles Healthcare 
System). Brains are removed from calvaria of donors between 4 and 24 hours after death, 

25 sectioned by neuroanatomists, and frozen at -80°C in liquid nitrogen vapor. All brains are 
sectioned and examined by neuropathologists to confirm diagnoses with clear associated 
neuropathology. 

Disease diagnoses are taken from patient records. The panel contains sixteen brains 
from Alzheimer's disease (AD) patients, and twenty-nine brains from ''Normal controls" 
30 who showed no evidence of dementia prior to death. The twenty-nine normal control 
brains are divided into two categories: Fourteen controls with no dementia and no 

154 



WO 03/080856 



PCT/US03/08490 



Alzheimer's like pathology (Controls) and fifteen controls with no dementia but evidence 
of severe Alzheimer's like pathology, (specifically senile plaque load rated as level 3 on a 
scale of 0-3; 0 = no evidence of plaques, 3 = severe AD senile plaque load). Tissue from 
the temporal cotex (Broddmann Area 21) was selected for all samples from the Harvard 

5 Brain Tissue Resource Center, from the two sample from the Human Brain and Spinal 
Fluid Resource Center (samples 1 and 2) tissue from the inferior and superior temporal 
cortex was used; each sample on the panel represents a pool of inferior and superior 
temporal cortex from an individual patient. The temporal cortex was chosen as it shows a 
loss of neurons in the intermediate stages of the disease. Selection of a region which is 

1 0 affected in the early stages of Alzheimer's disease (e.g., hippocampus or entorhinal cortex) 
could potentially result in the examination of gene expression after vulnerable neurons are 
lost, and missing genes involved in the actual neurodegeneration process. 

In the labels employed to identify tissues in the CNS_Neurodegeneration_V2.0 
panel, the following abbreviations are used: 

15 AD = Alzheimer's disease brain; patient was demented and showed AD-like 

pathology upon autopsy 

Control = Control brains; patient not demented, showing no neuropathology 
AH3 = Control brains; patient not demented but showing severe AD-like pathology 
Inf & Sup Temp Ctx Pool = Pool of inferior and superior temporal cortex for a 
20 given individual 

A. CG57008-02: HAVcr-1 

Expression of gene CG57008-02 was assessed using the primer-probe set Ag821, 
described in Table 5. Results of the RTQ-PCR runs are shown in Tables 6-17 

25 
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Table 5. Probe Name Ag821 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
No 


Forward 


5 1 -tcctcaagtggtcatcttaagc-3 1 


22 


57 


33 


Probe 


TET-5 • -tcctacatctggcagattctgtagctg-3 ■ - 
TAMRA 


27 


83 


34 


Reverse 


5 ' -tctccaccaacctttacagaac-3 ' 


22 


110 


35 



Table 6. Alcomprehensive panel_vl.O 



Column A - Rel. Exp.(%) Ag821, Run 2S949835S 



Tissue Name 


A 


Tissue Name 


A 


110967COPD-F 


16.7 


1 12427 Match Control Psoriasis-F 


31.4 


1 10980 COPD-F 


7.2 


112418 Psoriasis-M 


6.6 


110968 COPD-M 


6.6 


1 1 2723 Match Control Psoriasis-M 


13.3 


110977 COPD-M 


0.0 


112419 Psoriasis-M 


10.7 


1 1 0989 Emphysema-F 


25.3 


1 12424 Match Control Psoriasis-M 


12.0 


110992Emphysema-F 


65.1 


112420 Psoriasis-M 


46.7 


1 10993 Emphysema-F 


10.9 


1 12425 Match Control Psoriasis-M 


38.2 


1 10994 Emphysema-F 


8.0 


104689 (MF) OA Bone-Backus 


12.4 


11 0995 Emphysema-F 


100.0 


104690 (MF) Adj "Normal" Bone-Backus 


12.7 


1 10996 Emphysema-F 


41.2 


104691 (MF) OA Synovium-Backus 


14.4 


1 10997 Asthma-M 


3.4 


104692 (BA) OA Cartilage-Backus 


5.0 


111001 Asthma-F 


11.1 


104694 (BA) OA Bone-Backus 


3.8 


111002Asthma-F 


5.4 


104695 (BA) Adj "Normal" Bone-Backus 


6.0 


1 1 1003 Atopic Asthma-F 


16.0 


104696 (BA) OA Synovium-Backus 


6.3 


1 1 1004 Atopic Asthma-F 


14.6 


104700 (SS) OA Bone-Backus 


6.5 


1 1 1005 Atopic Asthma-F 


9.6 


104701 (SS) Adj "Normal" Bone-Backus 


8.4 


1 1 1006 Atopic Asthma-F 


2.7 


104702 (SS) OA Synovium-Backus 


15.8 


111417Allergy-M 


10.8 


1 1 7093 OA Cartilage Rep7 


40.6 


1 12347 Allergy-M 


0.8 


112672 0ABone5 


35.4 


1 12349 Normal Lung-F 


1.5 


1 12673 OA Synovium5 


19.3 


1 12357 Normal Lung-F 


23.2 


1 12674 OA Synovial Fluid cellsS 


12.2 


112354 Normal Lung-M 


14.0 


1 17100 OA Cartilage Repl4 


1.5 


112374 Crohns-F 


12.8 


112756 OA Bone9 


6.3 


1 12389 Match Control Crohns-F 


3.6 


1 12757 OA Synovium9 


4.7 


112375 Crohns-F 


8.6 


1 12758 OA Synovial Fluid Cells9 


9.3 


1 12732 Match Control Crohns-F 


3.2 


1 17125 RA Cartilage Rep2 


9.5 
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1 1 J*t7/ oy IIU VILLI I U XS-TV 


5 8 


112386 Ulcer Col-F 


5.2 


1 1 3498 Syn Fluid Cells3 RA 


10.5 


1 12738 Match Control Ulcer Col-F 


1.1 


1 17106 Normal Cartilage Rep20 


0.9 


112381 Ulcer Col-M 


3.0 


113663 Bone3 Normal 


3.1 


1 12735 Match Control Ulcer Col-M 


9.9 


1 13664 Synovium3 Normal 


0.0 


112382 Ulcer Col-M 


8.7 


1 13665 Syn Fluid Cells3 Normal 


1.0 


1 12394 Match Control Ulcer Col-M 


2.8 


1 17107 Normal Cartilage Rep22 


8.3 


112383 Ulcer Col-M 


37.6 


113667 Bone4 Normal 


11.9 


1 12736 Match Control Ulcer Col-M 


2.2 


1 13668 Synovium4 Normal 


23.0 


112423 Psoriasis-F 


8.9 


1 13669 Syn Fluid Cells4 Normal 


20.9 



Table 7. Ardais Panel 1.1 

Column A - Rel. Exp.(%) Ag821, Run 325595050* 



Tissue Name 


A 


Tissue Name 


A 


Lung adenocarcinoma SI A 


3.7 


Lung SCC SI A 


6.3 


Lung adenocarcinoma SI B 


13.4 


Lung SCC SI B NAT 


7.5 


Lung adenocarcinoma SI B NAT 


9.5 


Lung SCC SI C 


0.8 


Lung adenocarcinoma SI C 


0.6 


Lung SCC SI C NAT 


13.3 


Lung adenocarcinoma SI C NAT 


6.0 


Lung SCC SI D 


9.9 


Lung adenocarcinoma SII A 


6.3 


Lung SCC SID NAT 


1.7 


Lung adenocarcinoma SII A NAT 


6.8 


Lung SCC SB A 


3.7 


Lung adenocarcinoma SII C NAT 


28.1 


Lung SCC SII B 


6.4 


Lung adenocarcinoma Sin A 


10.7 


Lung SCC SHI A 


1.8 


Lung adenocarcinoma SHI B 


2.5 


Lung SCC SniANAT 


3.0 


Lung adenocarcinoma SHI C 


100.0 
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Table 8. Ardais Panel v.1.0 



Column A - Rel. Exp.(%) Ag821, Run 263526496 


Tissue Name 


A 


Tissue Name 


A 


136799_Lung cancer(362) 


6.1 


136787_lung cancer(356) 


45.1 


136800_LungNAT(363) 


0.8 


136788_lungNAT(357) 


17.2 


136813_Lung cancei(372) 


19.3 


136804_Lung cancer(369) 


100.0 


136814_LungNAT(373) 


8.5 


136805_LungNAT(36A) 


11.1 


13681 5_Lung cancer(374) 


10.6 


136806_Lung cancer(36B) 


13.8 


136816_LungNAT(375) 


19.5 


136807_Lung NAT(36C) 


6.3 


136791_Lung cancer(35A) 


34.9 


136789_lung cancer(358) 


59.5 


136795_Lung cancer(35E) 


9.3 


136802_Lung cancer(365) 


9.0 ! 


136797_Lung cancer(360) 


11.1 


136803_Lung cancei(368) 


6.9 


136794_lungNAT(35D) 


26.1 


13681 l_Lung cancer(370) 


80.7 


136818_LungNAT(377) 


5.1 


136810_LungNAT(36F) 


2.4 



Table 9. CNS_neurodegeneration_vl.O 



Column A - Rel. Exp.(%) Ag821, Run 271695186 


Tissue Name 


A 


Tissue Name 


A 


AD 1 Hippo 


9.7 


Control (Path) 3 Temporal Ctx 


6.2 


AD 2 Hippo 


14.5 


Control (Path) 4 Temporal Ctx 


22.5 


AD 3 Hippo 


7.3 


AD 1 Occipital Ctx 


10.9 


AD 4 Hippo 


9.1 


AD 2 Occipital Ctx (Missing) 


0.0 


AD 5 Hippo 


29.1 


AD 3 Occipital Ctx 


6.7 


AD 6 Hippo 


97.3 


AD 4 Occipital Ctx 


9.1 


Control 2 Hippo 


32.3 


AD 5 Occipital Ctx 


9.6 


Control 4 Hippo 


19.2 


AD 6 Occipital Ctx 


20.6 


Control (Path) 3 Hippo 


6.4 


Control 1 Occipital Ctx 


3.4 


AD 1 Temporal Ctx 


25.2 


Control 2 Occipital Ctx 


36.6 


AD 2 Temporal Ctx 


37.9 


Control 3 Occipital Ctx 


9.7 


AD 3 Temporal Ctx 


4.1 


Control 4 Occipital Ctx 


6.8 


AD 4 Temporal Ctx 


15.0 


Control (Path) 1 Occipital Ctx 


80.7 


AD 5 Inf Temporal Ctx 


60.7 


Control (Path) 2 Occipital Ctx 


12.2 


AD 5 Sup Temporal Ctx 


28.5 


Control (Path) 3 Occipital Ctx 


5.4 


AD 6 Inf Temporal Ctx 


97.3 


Control (Path) 4 Occipital Ctx 


14.8 


AD 6 Sup Temporal Ctx 


100.0 


Control 1 Parietal Ctx 


2.5 


Control 1 Temporal Ctx 


| 4.9 


Control 2 Parietal Ctx 


25.0 
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Control 2 Temporal Ctx 


14.5 


Control 3 Parietal Ctx 


9.9 


Control 3 Temporal Ctx 


7.5 


Control (Path) 1 Parietal Ctx 


40.9 


Control 3 Temporal Ctx 


15.9 


Control (Path) 2 Parietal Ctx 


27.5 


Control (Path) 1 Temporal Ctx 


41.2 


Control (Path) 3 Parietal Ctx 


2.2 


Control (Path) 2 Temporal Ctx 


35.6 


Control (Path) 4 Parietal Ctx 


53.6 



Table 10. General screening, panel yl.6 



Column A - Rel. Exp.(%) Ag821, Run 278391652 



J. ISfafUv name 


A 


1 issue lialllC 


A 




0 0 

v.v 


Renal ca TK-1 0 
xvviuti tOi x xv l v 


51 4 


Mplannma* T-RfiKKfA 1 T 
IVlClallUIIla IlaUOO^Aj. 1 


0 0 

v.v 




0 2 


Melanoma* H<;688fR i T 


0 0 

V.V 


fiastric ca fliver met i NCI-N87 


0.0 


IVlClallUIIla 1VX 1 1 


0 0 

v.v 


Gastric ca KATO TTT 

xjaoitiv va • i\At v xxx 


0 1 

V. 1 


ivjLviaii villa j_<v-/y\J.ivi v i 


0 0 


Colon ca SW-948 

W1V1A V(li TV 7TO 


0.0 


Melanoma* ^K-MFT -5 


00 

V» V 


Colon ca SW480 


0.0 


oijucuiiuuo veil waiviiiviixa kjv^v~*t 


0 0 


Colon ca * CSW480 met i SW620 


0.0 


Tpctic Pool 


0 1 

V. 1 


Colon ca HT29 


0.4 


Prnctate ra * fhone met i PC-5 


00 


Colon ca HCT-116 


0.0 


Pmctatp PfMll 
x i vavaiv i vvi 


0 0 

v.v 


Colon ca CaCo-2 


97.3 


Placenta 


fi o 


L/Oion cancer ussue 


A ft 


Uterus Pool 


0.0 


Colon ca. SW1116 


0.0 


Ovarian ca. OVCAR-3 


0.0 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK-OV-3 


4.1 


Colon ca.SW-48 


0.0 


Ovarian ca. OVCAR-4 


0.0 


Colon Pool 


0.1 


Ovarian ca. OVCAR-5 — — 1 


35.6 


Small Intestine Pool 


0.1 


Ovarian ca. IGROV-1 


6.5 


Stomach Pool 


0.1 


Ovarian ca. OVCAR-8 


0.0 


Bone Marrow Pool 


0.0 


Ovary 


0.0 


Fetal Heart 


0.1 


Breast ca. MCF-7 


0.0 


Heart Pool 


0.0 


Breast ca. MDA-MB-231 


0.0 


Lymph Node Pool 


0.1 


Breast ca. BT 549 


0.1 


Fetal Skeletal Muscle 


0.1 


Breast ca. T47D 


0.0 


Skeletal Muscle Pool 


0.0 


Breast ca. MDA-N 


0.0 


Spleen Pool 


0.1 


Breast Pool 


0.1 


Thymus Pool 


0.1 


Trachea 


0.0 


CNS cancer (glio/astro) U87-MG 


0.0 


Lung 


0.0 


CNS cancer (glio/astro) U-l 18-MG 


0.0 


Fetal Lung 


0.1 


CNS cancer (neuro;met) SK-N-AS 


0.0 


Lung ca. NCI-N417 


0.0 


CNS cancer (astro) SF-539 


0.0 
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Lung ca. La-1 


n i 

U.l 


L^iNo cancer ^asiroj oind- / j 


n i 

U. 1 


Lung ca. iNCi-xii^o 


a a 
u.u 


L/iNo cancer \&iio) oiSD-iy 




Lung ca. orir- / / 


a a 


ciNo cancer i,g"°.J or-zyj 


a a 

U.U 


Lung ca. A549 


Zoo 


Brain (Amygdala) Pool 


U.l 


Lung ca. NCI-ri^zo 


A A 

u.u 


Brain (cerebellum) 


a o 
u.z 


Lung ca. NCI-H23 


A a 

u.u 


I5rain (ietai ) 


u.z 


Lung ca. NCI-rl4oO 


a a 
U.U 


Brain (Hippocampus) Pool j 


a i 

U.l 


Lung ca. HUr-oZ 


a 1 
U.l 


cerebral cortex rooi 


u.x 


T una rn 


0 0 


Rmin f^iiViQtATvHfl nicrra^ PaaI 

Ul Alii 1 JUUdUUlUfl 111 a J X \J\Jl 


0.1 


Liver 


0.0 


Brain (Thalamus) Pool 


0.2 


Fetal Liver 


0.5 


Brain (whole) 


0.2 


Liver ca. HepG2 


1.7 


Spinal Cord Pool 


0.1 


Kidney Pool 


0.1 


Adrenal Gland 


0.0 


Fetal Kidney 


0.4 


Pituitary gland Pool 


0.0 


Renal ca. 786-0 


100.0 


Salivary Gland 


0.0 


Renal ca. A498 


18.9 


Thyroid (female) 


0.0 


Renal ca. ACHN 


59.0 


Pancreatic ca. CAPAN2 


0.0 


Renal ca. UO-31 


82.9 


Pancreas Pool 


0.1 



Table 11. HASS Panel vl.O 



Column A - Rel. Exp.(%) Ag821, Run 248122719 


Tissue Name 


A 


Tissue Name 


I A 


MCF-7 CI 


0.0 


U87-MGF1 (B) 


0.0 


MCF-7 C2 


0.0 


U87-MG F2 


0.0 


MCF-7 C3 


0.0 


U87-MGF3 


0.0 


MCF-7 C4 


0.0 


U87-MGF4 


0.0 


MCF-7 C5 


0.1 


U87-MG F5 


0.0 


MCF-7 C6 


0.0 


U87-MGF6 


0.0 


MCF-7 C7 


0.0 


U87-MG F7 


0.0 


MCF-7 C9 


0.0 


U87-MGF8 


0.0 


MCF-7 C10 


0.0 


U87-MGF9 


0.0 


MCF-7 CI 1 


0.0 


U87-MGF10 


0.0 


MCF-7 C12 


0.0 


U87-MGF11 


0.0 


MCF-7 C13 


0.0 


U87-MGF12 


0.0 


MCF-7 C15 


0.0 


U87-MGF13 


0.0 


MCF-7 C16 


0.0 


U87-MGF14 


0.0 


MCF-7 C 17 


0.0 


U87-MGF15 


0.0 


T24D1 


0.0 


U87-MGF16 


0.0 
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T74 F>? 


ft 0 
u.u 


TTR7-MCS F17 
uo/ ivivj rw 






0 0 

V/.v 


T nPAP A1 


00 

V.V/ 


T94 D4 


l/.U 


T nPAP AO 


0 0 

v/.v/ 




0 0 


T nPAP Al 


0 0 

v/.v/ 


TO A Ti(\ 


U.U 


T nPAP A4 


ft ft 

v/.v/ 


TO A TYI 


ft 0 


f nPAP AS 


0 0 

V/.V/ 


TO A no 
iz** ijy 


u.u 


T nPAP Afi 


0 0 

v/.v 


to a nin 

lZ*t L/1U 


0 ft 
u.u 


T nPAP A7 


ft 0 

V/.V/ 


TO A ni 1 


ft ft 

U.U 


T nPAP AR 


0 0 

v/.v 


TO a m 7 


ft ft 
u.u 


T nPAP AO 


ft ft 

V/.v 


toa rn ^ 


ft ft 
u.u 


T tiPAP A 1ft 


ft ft 


1/4 LI ID 


ft ft 


T t\C A P A 1 1 
LlJV/Ar All 


ft ft 


JLHO 


u.u 


T «PAP A 17 
JLIlUAr A1Z 


ft ft 
u.u 


T*)A Til 7 


ft ft 
u.u 


T nPAP A 1 *\ 
LIlL/A-r A JO 


ft ft 
u.u 


C A Pa\T "D 1 


u.u 


T nPAP A 14 


ft ft 

u.u 


L/AJraJN Jt5z 


U.U 


T tiP AP AK 
LIlls At AID 


ft ft 
u.u 


CAralN do 


U.U 


T nPAP A 1 A 
L.11L/A_r AID 


ft ft 
u.u 




n ft 
u.u 


T nPAP A 1 7 
LTlls Ar A 1 / 


ft ft 
u.u 




u.u 


Primaiy Astrocytes 


ft ft 
u.u 


V^/vraIN DO 


ft ft 
u.u 


ri unary ivcnai rTOAimai i uuuie npiuiciiai ecu t\M 


inn ft 

1UU.U 


PAPaltf T*7 


ft ft 
u.u 


Pritnan/ mplanA^vtpc A *\ 

x riindiy iiiciaiiuwy ico t\j 


0 0 


CAPaN B8 


0.0 


126443 - 341 medullo 


0.0 


CAPaN B9 


0.0 


126444- 487 medullo 


0.1 


CAPaN BIO 


0.0 


126445 -425 medullo 


0.1 


CAPaN Bll 


0.0 


126446 - 690 medullo 


0.2 


CAPaN B12 


0.0 


126447 - 54 adult glioma 


0.1 


CAPaN B13 


0.0 


126448 - 245 adult glioma 


0.1 


CAPaN B14 


0.0 


126449 -317 adult glioma 


0.0 


CAPaN B15 


0.0 


126450 - 212 glioma 


0.2 


CAPaN B16 


0.0 


126451 -456 glioma 


0.2 


CAPaN B17 


0.0 
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Table 12. Panel 1.2 



Column A - Rel. Ex 


>.(%) Ag821, Run 118348335 


l issue r^ame 


A 


tissue tune 


A 


Endothelial cells 


a a 
u.u 


rvenai ca. /oo-u 


OJ.J 


TT a. /C a 4„1\ 

Heart (retal; 


a a 
u.u 


Kenai ca. /vtyo 


74 7 


rancreas 




p pna 1 ra pYp WX 

jvenai ca. xvajt J7j 




rancreanc ca. cat ajn z 


a a 
u.u 


Kcnai ca. /vcitxn 


inn n 


Adrenal Gland 


A 0 

u.z 


Kenai ca. uu-ji 


70 7 


l nyroia 


a 7 
u. / 


P A na1 r>a TV 1A 

ivenai ca. i jv-iu 


71 7 


Salivary gland 


a a 

U.*f 


Liver 


0 7 


Pituitary gland 


A ^ 
U.j 


Liver (ietai j 


A 1 


brain (letal) 


A 1 

U.j 


T lira. s«n ill rtnnt r> Vi lac #* 1 Uani.J 

Liver ca. ^nepaioDiasij xiepvjz 


7 4. 


Brain (whole) 


A ^ 

U.J 


Lung 


A 1 

V.J 


Brain (amygdala) 


A 0 


L>ung (ieiai/ 


V.J 


Brain (cerebellum) 


A ^ 


mng ca. ^sman ceiij 


A 4 


Brain (hippocampus) 


A 3 
U.J 


T iin ff ra /'email /*#>1H "NIPT-l-lfiQ 

Lung ca. (smaii ceuj iN^i-noy 


v.t 


Brain (thalamus) 


A 1 
U.l 


T nnn r*a r»*»ll uar \ QT4P-77 

idling ca. ^s.ceii var.y onr- / / 


0 1 


cere oral cortex 


A 4 


Lung ca. ^loigc ceu^iNv^i-riHuv/ 


0 0 


Spinal cord 


A 9 
U.Z 


idling va. ^IMJI1~5>II1. VCll^ AJ*T7 




glio/astro Uo /-mu 


A A 

U.U 


Lung ca. \non-s.cejij inwi-hzj 


0 1 

V. 1 


guo/astro u-i l o-mlj 


A A 
U.U 


LrUng ca. ^non-s.ceiij nur-uz 


0 7 

v. / 


astrocytoma owi/w 


A A 
U.U 


mng ca. \non-s.ciy inv^i^iijzz 


0 7 


neuro , met dJv-im-Ao 


A 7 
U.Z 


Lung ca. (Squam.j o w yuu 


A 7 


aauuwjrUJiiici 01 JJ7 


0.1 


Lune ca fsauam ^ NCI-H596 


0.1 


astrocytoma SNB-75 


0.0 


Mammary gland 


0.3 


glioma SNB-19 


0.5 


Breast ca.* (pl.ef) MCF-7 


0.2 


glioma U251 


0.1 


Breast ca.* (pl.ef) MDA-MB-231 


0.1 


glioma SF-295 


0.0 


Breast ca.* (pi. ef) T47D 


0.3 


Heart 


0.6 


Breast ca. BT-549 


0.1 


Skeletal Muscle 


0.2 


Breast ca. MDA-N 


0.0 


Bone marrow 


0.1 


Ovary 


0.1 


Thymus 


0.1 


Ovarian ca. OVCAR-3 


0.1 


Spleen 


0.3 


Ovarian ca. OVCAR-4 


0.2 


Lymph node 


0.2 


Ovarian ca. OVCAR-5 


48.6 


Colorectal Tissue 


0.1 


Ovarian ca. OVCAR-8 


0.1 


Stomach 


0.3 


Ovarian ca. IGROV-1 


26.1 


Small intestine 


0.2 


Ovarian ca. (ascites) SK-OV-3 


3.9 
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Colon ra <JW48ft 


0 0 

V.V 


T TfpniQ 


0.1 


V/d.. lj W UZ.V TV tOv Illvt/ 




rlavCiiui 


0.8 


Colon ca. HT29 


0.5 


Prostate 


0.6 


Colon ca. HCT-116 


0.0 


Prostate ca.* (bone met) PC-3 


0.1 


Colon ca. CaCo-2 


59.0 


Testis 


1.1 


Colon ca. Tissue (OD03866) 


0.0 


Melanoma Hs688(A).T 


0.0 


Colon ca. HCC-2998 


0.2 


Melanoma* (met) Hs688(B).T 


0.0 


Gastric ca.* (liver met) NCI-N87 


0.2 


Melanoma UACC-62 


0.0 


Bladder 


0.8 


Melanoma Ml 4 


0.0 


Trachea 


0.1 


Melanoma LOXIMVI 


0.0 


Kidney 


2.9 


Melanoma* (met) SK-MEL-5 


0.0 


Kidney (fetal) 


0.0 







Table 13. Panel 3D 



Column A - Rel. Exp.(%) Ag821, Run 164729956 



Tissue Name 


A 


Tissue Name 


A 


Daoy- Medulloblastoma 


0.0 


Ca Ski- Cervical epidermoid carcinoma 
(metastasis) 


0.0 


TE671- Medulloblastoma 


0.0 


ES-2- Ovarian clear cell carcinoma 


0.0 


D283 Med- Medulloblastoma 


0.0 


Ramos- Stimulated with PMA/ionomycin 
6h 


0.0 


PFSK-1- Primitive 
Neuroectodermal 


0.0 


Ramos- Stimulated with PMA/ionomycin 
14h 


0.0 


XF-498- CNS 


0.0 


MEG-01- Chronic myelogenous leukemia 
(megokaryoblast) 


0.0 


SNB-78- Glioma 


0.0 


Raji- Burkitt's lymphoma 


0.0 


SF-268- Glioblastoma 


0.0 


Daudi- Burkitt's lymphoma 


0.1 


T98G- Glioblastoma 


0.0 


U266- B-cell plasmacytoma 


2.3 


SK-N-SH- Neuroblastoma 
(metastasis) 


0.0 


CA46- Burkitt's lymphoma 


0.0 


SF-295- Glioblastoma 


0.0 


RL- non-Hodgkiris B-cell lymphoma 


0.0 


Cerebellum 


0.1 


JM1- pre-B-cell lymphoma 


0.0 


Cerebellum 


0.0 


Jurkat- T cell leukemia 


0.0 


NCI-H292- Mucoepidermoid lung 
carcinoma 


0.0 


TF- 1 - Erythroleukemia 


0.0 


DMS-1 14- Small cell lung cancer 


0.0 


HUT 78- T-cell lymphoma 


0.0 


DMS-79- Small cell lung cancer 


0.3 


U937- Histiocytic lymphoma 


0.0 


NCI-H146- Small cell lung cancer 


0.0 


KU-812- Myelogenous leukemia 


0.0 


NCI-H526- Small cell lung cancer 


0.0 


769-P- Clear cell renal carcinoma 
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NCI-N417- Small cell lung cancer 


0.0 


Caki-2- Clear cell renal carcinoma 


18.8 


NCI-H82- Small cell lung cancer 


1.6 


SW 839- Clear cell renal carcinoma 


37.4 


NCI-H157- Squamous cell lung 
cancer (metastasis) 


A A 
U.U 


Rhabdoid kidney tumor 


A A 

U.U 


IMLl-ril 155- Large cell lung cancer 


A A 


Hs766T- Pancreatic carcinoma (LN 
metastasis) 


A 1 
U.I 


NLI-H1299- Large cell lung cancer 


a r\ 
U.U 


CAPAN-1- Pancreatic adenocarcinoma 
(liver metastasis) 


A A 

U.U 


NPI-H727- I une carcinoid 


1.9 


SU86.86- Pancreatic carcinoma (liver 
metastasis) 


0.3 


NCI-UMC-1 1- Lung carcinoid 


14.4 


BxPC-3- Pancreatic adenocarcinoma 


0.0 


LX-1- Small cell lung cancer 


0.0 


HPAC- Pancreatic adenocarcinoma 


0.0 


Colo-205- Colon cancer 


0.1 


MIA PaCa-2- Pancreatic carcinoma j 


0.0 


rvivii^- v^uiun L-oiiuci 


0 0 

v. v 


CFPAC-1- Pancreatic ductal 
adenocarcinoma 


4.4 


KM20L2- Colon cancer 


0.0 


PANC-1- Pancreatic epithelioid ductal 
carcinoma 


0.0 


NCI-H716- Colon cancer 


3.1 


T24- Bladder carcinma (transitional cell) 


0.0 


SW-48- Colon adenocarcinoma 


0.0 


5637- Bladder carcinoma 


0.0 


SW1 1 16- Colon adenocarcinoma 


0.0 


HT- 1 1 97- Bladder carcinoma 


5.6 


LS 174T- Colon adenocarcinoma 


0.0 


UM-UC-3- Bladder carcinma (transitional 
cell) 


0.0 


SW-948- Colon adenocarcinoma 


0.0 


A204- Rhabdomyosarcoma 


0.0 


SW-480- Colon adenocarcinoma 


0.0 


HT-1080- Fibrosarcoma 


0.0 


NCI-SNU-5- Gastric carcinoma 


0.0 


MG-63- Osteosarcoma 


0.0 


KATO III- Gastric carcinoma 


0.0 


SK-LMS-1- Leiomyosarcoma (vulva) 


0.0 


NCI-SNU-16- Gastric carcinoma 


A ft 

U.U 


SJRH30- Rhabdomyosarcoma (met to 
bone marrow) 


A A 
U.U 


NCI-SNU-1- Gastric carcinoma 


0.8 


A431- Epidermoid carcinoma 


0.0 


RF-1- Gastric adenocarcinoma 


0.0 


WM266-4- Melanoma 


0.0 


RF-48- Gastnc adenocarcinoma 


0.0 


DU 145- Prostate carcinoma (brain 
metastasis) 


0.0 


MKN-45- Gastric carcinoma 


11.3 


MDA-MB-468- Breast adenocarcinoma 


0.0 


NCI-N87- Gastric carcinoma 


0.0 


SCC-4- Squamous cell carcinoma of 
tongue 


0.0 


OVCAR-5- Ovarian carcinoma 


0.2 


SCC-9- Squamous cell carcinoma of 
tongue 


0.0 


RL95-2- Uterine carcinoma 


0.0 


SCC-15- Squamous cell carcinoma of 
tongue 


0.0 


HelaS3- Cervical adenocarcinoma 


0.0 


CAL 27- Squamous cell carcinoma of 
tongue 


0.0 



164 



WO 03/080856 



PCT/US03/08490 



Table 14. Panel 4D 



Column A - Rel. Ex 
Column B - Rel. Ex 


p.(%) Ag821, Run 145386336 
p.(%) Ag821, Run 145608640 


Tissue Name 


A 


B 


Tissue Name 


A 


B 


Secondary Thl act 


17.3 


18.2 


HUVEC IL-lbeta 


0.2 


0.1 


^eermdarv Thl act 


4 7 


7 1 


HT IVFP TFN onmma 


0 2 


0 6 


Secondary Trl act j 


12.7 


7.2 


HUVEC TNF alpha + IFN gamma 


0.1 


0.0 


Secondary Thl rest 


12.9 


14.2 


TTITirPO • 1 'VTT^ 1 1 • XT A 

HUVEC TNF alpha + IL4 


0.3 


0.2 


Secondary Th2 rest 


2.7 


2.7 


HUVEC IL-11 ; 


0.2 


0.3 


Secondary Trl rest 


5.9 


5.6 


Lung Microvascular EC none 


0.8 


0.7 


Primary Thl act 


4.5 


6.5 


Lung Microvascular EC TNFalpha + 
IL-lbeta 


0.3 


0.1 


Primary Th2 act 


2.8 


2.0 


Microvascular Dermal EC none 


1.2 


0.9 


Primary Trl act 

l lUUtll Y 111 Uvl 


5.7 


7.1 


Microsvasular Dermal EC TNFalpha 
+ IL-lbeta 


0.4 


0.4 


Primary Thl rest 


69.7 


54.0 


Bronchial epithelium TNFalpha + 
ILlbeta 


0.7 


0.4 


Primary Th2 rest 


23.7 


29.9 


Small airway epithelium none 


0.1 


0.1 


Primarv Trl rest 


5.4 


3.6 


Small airway epithelium TNFalpha + 
IL-lbeta 


2.7 


2.1 


CD45RA CD4 lymphocyte 
act 


0.7 


0.5 


Coronery artery SMC rest 


0.2 


0.1 


CD45RO CD4 lymphocyte 

opt 

awl 


16.2 


10.9 


Coronery artery SMC TNFalpha + 
IT -Iheta 


0.2 


0.3 


CD8 lymphocyte act 


0.5 


1.0 


Astrocytes rest 


1.0 


0.3 


Secondary CD8 lymphocyte 
rest 


0.9 


0.7 


Astrocytes TNFalpha + IL-lbeta 


1.1 


0.8 


Secondary CD8 lymphocyte 
act 


6.1 


6.8 


KU-8 1 2 (Basophil) rest 


0.3 


0.4 


CD4 lymphocyte none 


0.3 


0.5 


KU-812 (Basophil) PMA/ionomycin 


0.9 


0.9 


2ryThl/Th2/Trl anti-CD95 
CH11 


8.4 


7.9 


CCD 1 106 (Keratinocytes) none 


0.0 


0.1 


LAK cells rest 


5.2 


3.0 


CCD1 106 (Keratinocytes) TNFalpha 
+ IL-lbeta 


1.3 


0.1 


LAK cells IL-2 


9.1 


11.2 


Liver cirrhosis 


2.5 


2.0 


LAK cells IL-2+IL-12 


14.8 


13.0 


Lupus kidney 


100.0 


80.7 


LAK cells IL-2+IFN gamma 


12.7 


11.3 


NCI-H292 none 


0.0 


0.2 


LAK cells IL-2+ IL-18 


8.9 


7.5 


NCI-H292 IL-4 


0.6 


0.5 


LAK cells PMA/ionomycin 


6.6 


5.7 


NCI-H292 IL-9 


0.1 


0.1 
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NK Cells EL-2 rest 


11.1 


6.9 


NCI-H292 IL-13 


0.0 


0.0 


Two Way MLR 3 day 


11.0 


7.0 


NCI-H292 IFN gamma 


0.1 


0.0 


Two Way MLR 5 day 


1.5 


1.9 


HPAEC none 


0.5 


0.2 


Two Way MLR 7 day 


20.7 


3.4 


HPAEC TNF alpha + IL-1 beta 


0.4 


0.2 


PBMC rest 


1.5 


0.5 


Lung fibroblast none 


0.6 


0.3 


PBMC PWM 


22.8 


20.0 


Lung fibroblast TNF alpha + IL-1 
beta 


0.1 


0.0 


PBMC PHA-L 


9.6 


8.7 


Lung fibroblast IL-4 


0.4 


0.3 


Ramos (B cell) none 


0.2 


0.5 


Lung fibroblast IL-9 


0.1 


0.3 


Ramos (B cell) ionomycin 


0.9 


0.5 


Lung fibroblast IL-13 


0.7 


0.9 


B lymphocytes PWM 


3.5 


2.3 


Lung fibroblast IFN gamma 


0.2 


0.4 


B lymphocytes CD40L and 
IL-4 


v.O 




LJCllUdl nuiODlaSl V^I^L/1U/U ICSl 


u.z 




EOL-1 dbcAMP 


0.1 


0.2 


Dermal fibroblast CCD1070 TNF 
alpha 


12.1 


8.5 


EOL-1 dbcAMP 

x ivltv luiiuiiiy win 


0.4 


0.2 


Dermal fibroblast CCD1070 IL-1 

hpta 


0.1 


0.4 


Dendritic cells none 


6.0 


6.6 


Dermal fibroblast IFN gamma 


0.2 


0.2 


Dendritic cells LPS 


6.3 


4.3 


Dermal fibroblast IL-4 


0.1 


0.2 


Dendritic cells anti-CD40 


13.1 


6.1 


IBD Colitis 2 


0.5 


0.1 


Monocytes rest 


0.8 


0.6 


IBD Crohn's 


0.1 


0.1 


Monocytes LPS 


7.5 


3.9 


Colon 


0.2 


0.6 


Macrophages rest 


15.7 


15.2 


Lung 


0.7 


0.6 


Macrophages LPS 


7.6 


7.5 


Thymus 


97.9 


100.0 


HUVEC none 


0.1 


0.2 


Kidney 


4.7 


1.8 


HUVEC starved 


0.7 


1.4 









Table 15. Panel 5 Islet 



Column A - ReL Exp.(%) Ag821, Run 268362824 


Tissue Name 


A 


Tissue Name 


A 


97457_Patient-02go_adipose 


0.0 


94709_Donor 2 AM - A_adipose 


0.0 


97476_Patient-07sk_skeletal muscle 


0.0 


94710_Donor 2 AM - B_adipose 


0.0 


97477_Patient-07ut_uterus 


0.1 


9471 l_Donor 2 AM - C_adipose 


0.0 


97478_Patient-07pl_placenta 


0.0 


94712_Donor 2 AD - A_adipose 


0.1 


99 167_Bayer Patient 1 


0.0 


94713_Donor 2 AD - B adipose 


0.2 


97482 Patient-08ut uterus 


0.1 


94714_Donor 2 AD - C_adipose 


0.1 


97483_Patient-08pl__placenta 


0.0 


94742_Donor 3 U - A_Mesenchymal 
Stem Cells 


0.0 


97486 Patient-09sk skeletal muscle 


0.0 


94743_Donor 3 U - B_Mesenchymal 


0.0 
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1 


i 


Stem Cells 




97487 Patient-09ut uterus 


0.0 


94730_Donor 3 AM - A_adipose 


0.0 


97488_Patient-09pl_placenta 


0.0 


94731_Donor 3 AM - B_adipose 


0.0 


97492 Patient-lOut uterus 


0.1 


94732_Donor 3 AM - C_adipose 


0.0 


97493 J>atient-10pl_placenta 


0.0 


94733_Donor 3 AD - A_adipose 


0.2 


97495_Patient-l lgo_adipose 


0.0 


94734_Donor 3 AD - B_adipose 


0.1 


97496 Patient- llsk skeletal muscle 


0.0 


94735_Donor 3 AD - C_adipose 


0.1 


97497 Patient-llut uterus 


0.1 


77138_Liver_HepG2untreated 


9.5 


97498_Patient-l lpl_placenta 


0.0 


73556 Heart Cardiac stromal cells 
(primary) 


0.0 


97500_Patient- 1 2go_adipose 


0.0 


81735 Small Intestine 


0.5 


97501_Patient-12sk_skeletal muscle 


0.0 


72409_Kidney_Proximal Convoluted 
Tubule 


82.9 


97502 PatienM2ut uterus 


0.1 


82685 Small intestine Duodenum 


0.0 


97503J > atient-12pl_placenta 


0.0 


90650 Adrenal Adrenocortical adenoma 


0.0 


94721_Donor 2 U - A_Mesenchymal 
Stem Cells 


0.0 


72410_Kidney_HRCE 


100.0 


94722_Donor 2 U - B_Mesenchymal 
Stem Cells 


0.0 


72411_Kidney_HRE 


70.7 


94723_Donor 2 U - C_Mesenchymal 
Stem Cells 


0.0 


73139_Uterus_Uterine smooth muscle 
cells 


0.0 



Table 16. general oncology screening panel_v_2,4 



Column A - Rel. Exp.(%) Ag821, Run 258052111 
Column B - Rel. Exp.(%) Ag821, Run 258680990 
Column C - Rel. Exp.(%) Ag821, Run 259733172 


Tissue Name 


A 


B 


c 


Tissue Name 


A 


B 


c 


Colon cancer 1 


0.1 


0.1 


0.1 


Bladder NAT 2 


0.0 


0.0 


0.0 


Colon NAT 1 


0.0 


0.0 


0.0 


Bladder NAT 3 


0.0 


0.0 


0.0 


Colon cancer 2 


0.0 


0.0 


0.0 


Bladder NAT 4 


0.0 


0.0 


0.0 


Colon NAT 2 


0.1 


0.1 


0.1 


Prostate adenocarcinoma 1 


0.1 


0.2 


0.2 


Colon cancer 3 


0.2 


0.2 


0.2 


Prostate adenocarcinoma 2 


0.0 


0.0 


0.0 


Colon NAT 3 


0.6 


0.7 


0.7 


Prostate adenocarcinoma 3 


0.0 


0.0 


0.0 


Colon malignant cancer 4 


0.1 


0.0 


0.0 


Prostate adenocarcinoma 4 


0.1 


0.0 


0.6 


Colon NAT 4 


0.2 


0.2 


0.3 


Prostate NAT 5 


0.0 


0.0 


0.0 


Lung cancer 1 


0.0 


0.1 


0.1 


Prostate adenocarcinoma 6 


0.0 


0.0 


0.0 


Lung NAT 1 


0.0 


0.0 


0.0 


Prostate adenocarcinoma 7 


0.0 


0.0 


0.0 


Lung cancer 2 


3.7 


1.9 


2.7 


Prostate adenocarcinoma 8 


0.0 


0.0 


0.0 


Lung NAT 2 


0.0 


0.0 


0.0 


Prostate adenocarcinoma 9 


0.2 


0.0 


0.1 
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OlJUallluUa i/CXl taiuiiiuiiia J 


0 0 


00 

v.v 


0 1 


Pmctnfp "MAT 10 


0 0 

V/.V/ 


0.0 


0.0 


Lung NAT 3 


0.0 


0.0 


0.0 


Kidney cancer 1 


6.9 


2.6 


4.5 


Metastatic melanoma 1 


i.i 


0.1 


0.1 


Kidney NAT 1 


2.8 


2.1 


2.5 


Melanoma 2 


0.0 


0.0 


0.0 


Kidney cancer 2 


100.0 


100.0 


100.0 


Melanoma 3 


0.0 


0.0 


0.0 


Kidney NAT 2 


1.8 


0.9 


1.3 


Metastatic melanoma 4 


0.2 


0.1 


0.2 


Kidney cancer 3 


66.9 


48.0 


57.4 


Metastatic melanoma 5 


0.4 


0.2 


0.3 


Kidney NAT 3 


0.5 


0.2 


0.3 


Bladder cancer 1 


0.0 


0.0 


0.0 


Kidney cancer 4 


23.0 


11.7 


22.7 


Bladder NAT 1 


0.0 


0.0 


0.0 


Kidney NAT 4 


1.2 


0.9 


1.1 


Bladder cancer 2 


0.0 


0.0 


0.0 









Table 17. Ardais Kidney 1.0 



Column A - Rel. Exp.(%) 


Ag821, Run 343519949 


Tissue Name 


A 


Tissue Name 


A 


Kidney NAT( 1 0DA) 


2.6 


Kidney cancei(l 0C0) 


13.2 


Kidney NAT(10B1) 


6.2 


Kidney cancer(10D0) 


94.0 


Kidney NAT(1 ODE) 


0.6 


Kidney cancer(10C8) 


0.1 


Kidney NAT(1 ODD) 


12.0 


Kidney cancer(10B4) 


0.5 


Kidney NAT(10DC) 


3.8 


Kidney NAT(10C5) 


3.6 


Kidney NAT(IODB) 


9.6 


Kidney cancei(l 0C4) 


45.7 


Kidney NAT(10D9) 


4.9 


Kidney NAT(10C3) 


4.4 


Kidney cancer(10D3) 


1.4 


Kidney cancer(10C2) 


45.4 


Kidney cancer(10CC) 


17.2 


Kidney NAT(10BF) 


4.8 


Kidney cancer(10D8) 


11.3 


Kidney cancer(10BE) 


0.1 


Kidney cancer(10CF) 


0.2 


Kidney NAT(10BD) 


4.8 


Kidney cancer(10D6) 


0.2 


Kidney cancer(10BC) 


39.8 


Kidney cancer(10CE) 


8.7 


Kidney NAT(10B9) 


0.8 


Kidney cancet(10D5) 


0.0 


Kidney cancer(10B8) 


100.0 


Kidney cancer(10CD) 


8.5 


Kidney NAT(10B7) 


3.0 


Kidney cancet(10D4) 


1.2 


Kidney cancer(10B6) 


0.5 


Kidney cancer(10CB) 


20.4 


Kidney NAT(10AD) 


0.2 


Kidney cancer(10D2) 


21.9 


Kidney cancer(10AC) 


0.7 


Kidney cancer(10CA) 


19.3 


Kidney NAT(10AB) 


1.0 


Kidney cancer(10Dl) 


33.4 


Kidney cancei(10AA) 


| 10.0 


Kidney cancer(10C9) 


24.3 


Kidney NAT(10A9) 


1.1 


Kidney cancer(10C6) 


0.1 


Kidney cancer(10A8) 


1 0.1 
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AI_comprehensive panel_vl.O Summary: The expression of the transcript is 
widespread at moderate to low levels with higher expression detected in lung tissue from 
patients with emphysema (CT^Ol) as compared to expression in normal lung (CTs=34- 
37). 

5 Ardais Panel 1.1 Summary: Ag821 Highest expression of this gene is seen in a 

lung cancer sample (CT=27.4). This expression is consistent with the expression seen in 
Ardais Panel vl .0. Please see that panel for futher discussion of this gene in lung cancer. 

Ardais Panel v.1.0 Summary: Ag821 Highest expression of this gene is seen in a 
lung cancer sample (CT=29.2). In addition, this gene is consistently over-expressed in 
10 lung cancer when compared to matched normal tissue samples. Thus, expression of this 
gene could be used to differentiate between the lung cancer samples and other samples on 
this panel, and as a marker of lung cancer. Furthermore, therapeutic modulation of the 
expression or function of this gene or gene product may be useful in the treatment of lung 
cancer. 

15 CNS_neurodegeneration_vl.O Summary: Ag821 This experiment confirms the 

expression of the CG93088-01 gene at low levels in the brain in an independent group of 
individuals. This gene appears to be up-regulated in the temporal cortex of Alzheimer's 
disease patients when compared with non-demented controls. Thus, therapeutic 
modulation of the expression or function of this gene or gene product can slow or stop the 

20 progression of Alzheimer's disease. 

General screening panel vl. 6 Summary: Ag821 This gene is highly and 
specifically expressed by kidney cancer cell lines, suggesting that this gene product can be 
an effective kidney cancer marker. High levels of expression are also seen in a single 
colon cancer cell line, ovarian cancer cell lines and a lung cancer cell line. Thus, 

25 modulation of the expression or function of this gene can be useful in the treatment of 
kidney cancer. 

HASS Panel vl.O Summary: Ag821 Detectable levels of expression are limited 
to a kidney derived sample, consistent with expression in other panels. 

Panel 1.2 Summary: Ag821 Expression in this panel is in agreement with Panel 
30 1 .6. Please see that panel for discussion of this gene. 
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Panel 3D Summary: Ag821 Expression of this gene is limited to cell lines from 
renal, gastric, lung, pancreatic, bladder, myeloma, and colon cancers. 

Panel 4D Summary: Ag821 Two experiments with the same probe and primer set 
produce results that are in excellent agreement. The transcript is highly expressed in 
5 thymus and lupus kidney (CTs=27-28) but not in control kidney tissues or in normal 
colon. It is also induced in resting primary Thl and Th2 cells, but is expressed in lower 
levels in naive T cells and in chronically activated T cells. The resting primary T cell RNA 
comes from cultures of T cells, purified from cord blood, stimulated with CD3 and CD28 
monoclonal antibodies in the presence of either IL-12 (Thl cultures) or IL-4 (Th2 

10 cultures). These cultures contain many T cells that have not yet committed to the Thl or 
Th2 pathway (ThO cells). Thus, humanized monoclonal antibodies blocking Th2 effector T 
cell function would decrease inflammation associated with asthma, lupus and emphysema. 

Panel 5 Islet Summary: Ag821 Detectable levels of expression are limited to 
kidney derived samples (CTs=26), consistent with other panels. 

1 5 general oncology screening panel_v_2.4 Summary: Ag82 1 Very high levels of 

expression of this gene are seen in kidney cancer (CTs=24-25). Furthermore, this gene is 
more highly expressed in kidney cancer than in the corresponding normal adjacent tissue. 
Thus, expression of this gene could be used as a marker of this cancer. Furthermore, 
therapeutic modulation of the expression or function of this gene product can be useful in 

20 the treatment of kidney cancer. 

Ardais Kidney 1.0 Ag821 Highest expression of this gene is seen in a kidney 
cancer sample. Furthermore, this gene is consistently expressed at higher levels in kidney 
cancer than in the corresponding normal adjacent tissue, in agreement with previous 
panels. Thus, expression of this gene could be used as a marker of this cancer. 

25 Furthermore, therapeutic modulation of the expression or function of this gene product can 
be useful in the treatment of kidney cancer. 

B. CG51373-01:Nephrin 

Expression of gene CG5 1373-01 was assessed using the primer-probe sets Ag271b, 
described in Table 1 8. Results of the RTQ-PCR runs are shown in Table 19 and Table 20. 
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Table 18. Probe Name Ag271b 



Primers 


Sequences 


Length 


Start 
Position 


seq n> 

No 


Forward 


5 ' -caccgtgagccaactgcttat-3 ' 


21 


792 


36 


Probe 


TBT-5 1 -agacacgccctatgtccaggtccg-3 1 - 
TAMRA 


24 


821 


37 


Reverse 


5 1 -ttcgttcatgcttcggcaa-3 ' 


19 


849 


38 



Table 19. Panel 2.2 



Column A - Rel. Exp.(%) Ag271b, Run 175148876 


Tissue Name 


A 


Tissue Name 


A 


Normal Colon 


11*7 

13.7 


Kidney Margin (OD04348) 


100.0 


Colon cancer (OD06064) 


A C A 

45.4 


Kidney malignant cancer (UDUo2u4B) 


6.1 


Colon Margin (OD06064) 


6.7 


Kidney normal adjacent tissue 
(OD0o204b) 


12.3 


L'Oion cancer [KjiJvoijy) 


1 7 
1. / 


jviQney cancer \\jij\j*r*r ju-u i ) 


AR R 


Colon Margin (OD06159) 


7.0 


Kidney Margin (OD04450-03) 


24.0 


Colon cancer (OD06297-04) 


5.3 


Kidney Cancer 8120613 


0.0 


Colon Margin (OD06297-05) 


7.4 


Kidney Margin 8120614 


11.7 


CC Gr.2 ascend colon (OD03921) 


10.5 


Kidney Cancer 9010320 


15.5 


CC Margin (OD03921) 


13.3 


Kidney Margin 9010321 


16.6 


Colon cancer metastasis (OD06104) 


1.8 


Kidney Cancer 8120607 


70.7 


Lung Margin (OD06104) 


3.0 


Kidney Margin 8120608 


11.4 


Colon mets to lung (OD04451-01) 


21.0 


Normal Uterus 


36.6 


Lung Margin (OD04451-02) 


10.0 


Uterine Cancer 06401 1 


11.7 


Normal Prostate 


2.0 


Normal Thyroid 


1.6 


Prostate Cancer (OD04410) 


3.9 


Thyroid Cancer 064010 


10.2 


Prostate Margin (OD04410) 


10.3 


Thyroid Cancer A302152 


9.3 


Normal Ovary 


78.5 


Thyroid Margin A302153 


4.9 


Ovarian cancer (OD06283-03) 


14.3 


Normal Breast 


31.4 


Ovarian Margin (OD06283-07) 


16.3 


Breast Cancer (OD04566) 


3.1 


Ovarian Cancer 064008 


41.8 


Breast Cancer 1024 


0.0 


Ovarian cancer (OD06145) 


36.6 


Breast Cancer (OD04590-01) 


10.2 


Ovarian Margin (OD06145) 


33.7 


Breast Cancer Mets (OD04590-03) 


19.1 


Ovarian cancer (OD06455-03) 


6.0 


Breast Cancer Metastasis (OD04655- 
05) 


6.0 


Ovarian Margin (OD06455-07) 


18.7 


Breast Cancer 064006 


14.2 
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Normal Lung 


20.7 


Breast Cancer 9100266 


12.5 


Invasive poor diff. lung adeno 
(ODO4945-01 


3.1 


Breast Margin 9100265 \ 


11.4 


Lung Margin (ODO4945-03) 


10.6 


Breast Cancer A209073 


15.7 


Lung Malignant Cancer (OD03126) 


5.8 


Breast Margin A2090734 


48.6 


Lung Margin (OD03126) 


10.4 


Breast cancer (OD06083) 


14.3 


Lung Cancer (OD05014A) 


6.2 


Breast cancer node metastasis 

(CXVsS\fS\RV\ 
\yjlJ\J\j\Jop ) 


15.6 


Lung Margin lhd ) 




iNOimai i^iver 


*T.J 


Lung cancer ^vjiajouo i ) 




i^iver cancer iuzo 


1Z. J 


Lung Margin (ULjuoUo l j 


A 1 
4.1 


Liver cancer iuzj 


19 ^ 


Lung Cancer (OD04237-01) 


3.2 


Liver Cancer 6004-T 


12.4 


Lung Margin (OD04237-02) 


37.9 


Liver Tissue 6004-N 


3.0 


Ocular Melanoma Metastasis 


18.8 


Liver Cancer 6005-T 


21.0 


Ocular Melanoma Margin (Liver) 


8.3 


Liver Tissue 6005-N 


26.2 


Melanoma Metastasis 


40.1 


Liver Cancer 064003 


2.5 


Melanoma Margin (Lung) 


13.5 


VT- 1 I51«JJ«. 

Normal Bladder 


y.i 


Normal Kidney 


16.0 


Bladder Cancer 1023 


8.4 


Kidney Ca, Nuclear grade 2 (OD04338) 


57.0 


Bladder Cancer A302173 


11.0 


Kidney Margin (OD04338) 


12.9 


Normal Stomach 


22.8 


Kidney Ca Nuclear grade 1/2 
(OD04339) 


73.2 


Gastric Cancer 9060397 


9.9 


Kidney Margin (OD04339) 


14.2 


Stomach Margin 9060396 


6.6 


Kidney Ca, Clear cell type (OD04340) 


20.0 


Gastric Cancer 9060395 


11.8 


Kidney Margin (OD04340) 


35.1 


Stomach Margin 9060394 


23.7 


Kidney Ca, Nuclear grade 3 (OD04348) 


25.0 


Gastric Cancer 064005 


6.1 



Table 20. Panel 4.1D 



Column A - Rel. Exp.(%) Ag271b, Run 174261189 


Tissue Name 


A 


Tissue Name 


A 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


21.0 


Secondary Th2 act 


0.4 


HUVEC DFN gamma 


19.2 


Secondary Trl act 


0.3 


HUVEC TNF alpha + IFN gamma 


26.6 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + IL4 


32.5 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


11.8 


Secondary Trl rest 


0.0 


Lung Microvascular EC none 


43.2 


Primary Thl act 


0.0 


Lung Microvascular EC TNFalpha + IL- 
lbeta 


34.4 
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Primary Th2 act 


0.0 


Microvascular Dermal EC none 


35.8 


Primary Trl act 


0.6 


Microsvasular Dermal EC TNFalpha + IL- 
lbeta 


26.4 


Primary Thl rest j 


0.3 


Bronchial epithelium TNFalpha + ILlbeta 


32.3 


Primary Th2 rest 


0.0 


Small airway epithelium none 


16.8 


Primary Trl rest 


0.0 


Small airway epithelium TNFalpha + IL- 
lbeta 


33.2 


CD45RA CD4 lymphocyte act 


33.9 


Coronery artery SMC rest 


47.3 


CD45RO CD4 lymphocyte act 


0.0 


Coronery artery SMC TNFalpha + IL-lbeta 


51.8 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


59.0 


Secondary CD8 lymphocyte rest 


0.0 


Astrocytes TNFalpha + IL-lbeta 


38.7 


Secondary CD8 lymphocyte act 


0.0 


KU-8 12 (Basophil) rest 


0.6 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) PMA/ionomycin 


1.0 


2ry Thl/Th2/Trl_anti-CD95 
CH11 


0.0 


CCD 11 06 (Keratinocytes) none 


36.1 


LAK cells rest 


0.1 


CCD1 106 (Keratinocytes) TNFalpha + DL- 

1 Koto 

ioeta 


30.6 


LAK cells 1L-Z 


u.u 


Liver cirrhosis 


A A 


LAK cells IL-Z-rLL- 1 L 


A A 

u.u 


VtV^T inoi n/ v nA 

JNui-rizyz none 


17 7 


LAK cells IL-2+lrN gamma 


A A 
U.U 




1 1 .0 


T AVr»/a11e«TT IX TT 10 

LAK ceUS IL-z*r lL,-lo 


A A 
U.U 


"MY^T T-I707 TT O 


1 7 1 


LAK cells PMA/ionomycin 


A 1 
U. 1 


xjpi T4707 TT 1 1 


1 7 Q 


XTV /^aIIc TT O roof 

nk ceils il-z rest 


A A 
U.U 


XTPI T4707 TTTM rramma 

iN^i-rizyz triN gamma 


17 R 


i wo way mlk 3 oay 


A A 
U.U 


xir/vtiL> none 


1 7 ft 


i wo way mlk j oay 


A 1 
U. 1 


rtrAiiL^ i iNr aipna t- jx,- i oeia 


1Q ft 

J7.0 


Two Way MLR 7 day 


0.3 


Lung fibroblast none 


71.7 


PBMC rest 


0.0 


Lung fibroblast TNF alpha + IL-1 beta 


55.5 


PBMC PWM 


0.0 


Lung fibroblast IL-4 


66.4 


PBMC PHA-L 


0.0 


Lung fibroblast IL-9 


100.0 


Ramos (B cell) none 


0.0 


Lung tiDroblast IL-1 J 




Ramos (B cell) ionomycin 


0.0 


Lung fibroblast BFN gamma 


70.7 


B lymphocytes PWM 


0.0 


Dermal fibroblast CCD1070 rest 


66.0 


B lymphocytes CD40L and IL-4 


0.3 


Dermal fibroblast CCD1070 TNF alpha 


61.1 


EOL-1 dbcAMP 


0.3 


Dermal fibroblast CCD1070 IL-1 beta 


61.6 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast IFN gamma 


43.2 


Dendritic cells none 


0.0 


Dermal fibroblast IL-4 


59.0 


Dendritic cells LPS 


0.6 


Dermal Fibroblasts rest 


36.6 


Dendritic cells anti-CD40 


0.0 


Neutrophils TNFa+LPS 


2.0 
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Monocytes rest 


0.0 


Neutrophils rest 


6.7 


Monocytes LPS 


0.0 


Colon 


4.4 


Macrophages rest 


0.0 


Lung 


20.4 


Macrophages LPS 


0.0 


Thymus 


11.6 


HUVEC none 


18.2 


Kidney 


26.6 


HUVEC starved 


27.0 







Panel 2.2 Summary: Ag271b This gene is significantly over-expressed in kidney 
cancer, melanoma and colon cancer when compared to normal adjacent tissue. Thus, 
expression of this gene could be used to differentiate between these samples and other 

5 samples on this panel and as a marker for these cancers. Furthermore, therapeutic targeting 
of CG5 1373-01 is anticipated to limit or block the extent of tumor cell migration and 
invasion and tumor metastasis, in these tumors. 

Panel 4.1D Summary: Ag271b The CG5 1373-01 gene, which encodes the 
extracellular domain of an immunoglobin domain containing membrane protein, is 

10 expressed in panel 4. ID in the following resting and cytokine-activated cells and tissues: 
HUVEC, lung microvascular endothelial cells, small airway epithelium, coronary artery 
smooth muscle cells, astrocytes, lung fibroblasts, and dermal fibroblasts. The CG51373-01 
gene product can be useful as a target for therapeutic antibodies which antagonize the 
function of the Ig domain-containing protein. Such antibodies can reduce or eliminate the 

IS symptoms in patients with inflammatory diseases and autoimmune diseases, such as 
multiple sclerosis, chronic obstructive pulmonary disease, asthma, emphysema, 
rheumatoid arthritis, or psoriasis. 

Example 8. Relevant pathways 

20 PathCalUng™ Technology: The sequence of Acc. No CG57008-02 was derived 

by laboratory screening of cDNA library by the two-hybrid approach. cDNA fragments 
covering either the full length of the DNA sequence, or part of the sequence, or both, were 
sequenced. In silico prediction was based on sequences available in CuraGen 
Corporation's proprietary sequence databases or in the public human sequence databases, 

25 and provided either the full-length DNA sequence, or some portion thereof. 



174 



WO 03/080856 



PCT/US03/08490 



cDNA libraries were derived from various human samples representing multiple 
tissue types, normal and diseased states, physiological states, and developmental states 
from different donors. Samples were obtained as whole tissue, primary cells or tissue 
cultured primary cells or cell lines. Cells and cell lines may have been treated with 
5 biological or chemical agents that regulate gene expression, for example, growth factors, 
chemokines or steroids. The cDNA thus derived was then directionally cloned into the 
appropriate two-hybrid vector (Gal4-activation domain (Gal4-AD) fusion). Such cDNA 
libraries (as well as commercially available cDNA libraries from Clontech (Palo Alto, 
CA)) were then transferred from Exoli into a CuraGen Corporation proprietary yeast 

10 strain (disclosed in U. S. Patents 6,057,101 and 6,083,693, incorporated herein by 
reference in their entireties). 

Gal4-binding domain (Gal4-BD) fusions of a CuraGen Corportion proprietary 
library of human sequences was used to screen multiple Gal4-AD fusion cDNA libraries 
resulting in the selection of yeast hybrid diploids in each of which the Gal4-AD fusion 

1 5 contains an individual cDNA. Each sample was amplified using the polymerase chain 
reaction (PCR) using non-specific primers at the cDNA insert boundaries. Such PCR 
product was sequenced; sequence traces were evaluated manually and edited for 
corrections if appropriate. cDNA sequences from all samples were assembled together, 
sometimes including public human sequences, using bioinformatic programs to produce a 

20 consensus sequence for each assembly. Each assembly is included in CuraGen 

Corporation's database. Sequences were included as components for assembly when the 
extent of identity with another component was at least 95% over 50 bp. Each assembly 
represents a gene or portion thereof and includes information on variants, such as splice 
foims single nucleotide polymorphisms (SNPs), insertions, deletions and other sequence 

25 variations. 

Physical clone: the cDNA fragment derived by the screening procedure, covering 
the entire open reading frame is, as a recombinant DNA, cloned into pACT2 plasmid 
(Clontech) used to make the cDNA library. The recombinant plasmid is inserted into the 
host and selected by the yeast hybrid diploid generated during the screening procedure by 
30 the mating of both CuraGen Corporation proprietary yeast strains N106' and YULH (U. S. 
Patents 6,057,101 and 6,083,693). 
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Interacting protein pairs are added to CuraGen's PathCal ling™ Protein Interaction 
Database. This database allows for the discovery of novel pharmaceutical drug targets by 
virtue of their interactions and/or presence in pathologically related signaling pathways. 
Protein interactions are subsequently analyzed using bioinformatic tools within 

5 GeneScape™, which provides a means of visualization of binary protein interactions, 

protein complex formation, as well as complete cellular signaling pathways. Specifically, 
as shown in Figure 1, the sequences, which encode proteins CG57008-02 (HAVcR-1) 
(NOVlc), CG51373 (NEPHl)(NOV2), LOC15546, and COL1 Al proteins were found to 
interact and can result in the formation of a protein complex, or may constitute a series of 

10 complexes, which form in order to propagate a cellular signal, which is physiologically 
relevant to a disease pathology. The specific interactions, which constitute the specific 
complexes, may also be useful for therapeutic intervention through the use of recombinant 
protein or antibody therapies, small molecule drugs, or gene therapy approaches. 

Protein interactions, which are identified through the mining of the PathCalling™ 

1 5 database, can be screened in vitro and in vivo to provide expression, functional, 

biochemical, and phenotypic information. Assays may be used alone or in conjunction 
and include, but are not limited to the following technologies; RTQ-PCR, transfection of 
recombinant proteins, co-immunoprecipitation and mass spectrometry, FRET, affinity 
chromatography, immunohistochemisty or immunocytochemistry, gene CHIP 

20 hybridizations, antisense (Le. knock-down, knock-up), GeneCalling experiments, and/or 
biochemical assays (phosphorylation, dephosphorylation, protease, etc.). 

As shown in Figure 1, PathCalling data shows that the mucin domain of the protein 
encoded by CG57008-02 (HAVcr-1) interacts with CG51373, nephrin-like I (NEPH1), a 
structural-cell adhesion molecule. Table 21 summarizes the amino acid sequences of the 

25 bait and prey used in three independent experiments to detect this novel interaction. This 
interaction suggests that CG51373 acts as a membrane-bound attractant for the CG57008- 
02 protein and that dysregulated signaling events promote kidney tumorigenesis. 
Expression data from RTQ-PCR Panel 2 (see Tables 16 and 19) show that both genes are 
over-expressed in kidney cancer when compared to expression in normal adjacent tissue, 

30 providing further support for this novel interaction. 

While mucins are highly O-glycosylated in mammalian cells, this interaction was 

discovered using the yeast two-hybrid system and therefore the mucin domain is not 
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expected to be O-glycosylated. Thus, this discovery may reflect an interaction that only 

occurs in the disease state, as mucin is known to be abnormally glycosylated in tumors. 

Figure 1 also shows that CG57008-02 interacts with LOCI 55465, a secreted protein that 

may play a role as a chemoattractant in organ development and is upregulated in some 
5 tumors (PMID: 9790916), and COL1A1, an extracellular matrix protein maybe relevant 

These interactions provide greater support and context for the novel CG57008-02 and 

CG5 1372 interaction. 

This novel interaction also suggests that disruption of the interaction between 

CG57008-02 and CG51373 can prevent the peripheral feedback immune response that 
10 leads to asthma. In addition, RTQ-PCR Panel 4. 1 D (see Table 20) shows high levels of 

expression of CG51373 in cytokine-activated lung fibroblasts. 



The use of antibodies to disrupt this interaction can be useful in the treatment of 
kidney cancers and asthma. 

Table 21. Yeast Two-hybrid Interaction Information 



Interaction 
Frame 


HAVCR1 Interaction 
Domain (aa) 


Nephrinl Interaction 
Domain (aa) 


Number of Yeast Colonies 
Observed 


U+) 


Bait:86-270 


Prey: 159-492 


1 


K+) 


Bait:86-270 


Prey: 159-492 


5 


1(+) 


Bait:86-270 


Prey: 159-491 


1 



15 

Domain analysis by PathCalling 1x1 Matrix Assay 

Table 22. Fragments of CG57008 and CG51373 tested in Matrix lxl assay 



CG57008 


Fragment 
(AA boundary) 


CG51373 


Fragment 
(AA boundary) 


CG57008-03 


21-60 


CG5 1373-07 


1-429 


CG57008-03 


21-110 


CG5 1373-07 


22-58 


CG57008-03 


21-130 


CG5 1373-07 


22-58 


CG57008-03 


21-170 


CG5 1373-07 


22-112 


CG57008-03 


21-210 


CG5 1373-07 


22-145 


CG57008-03 


21-240 


CG5 1373-07 


22-205 
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CG57008-03 


21-290 


CG5 1373-07 


22-255 


CG57008-03 


60-290 


CG5 1373-07 


22-317 


CG57008-03 


106-290 


PfiS 1373-07 




CG57008-03 


131-210 


CfiS 1 373-07 


22-429 


CG57008-03 


131-290 


CG5 1373-07 


51-429 


CG57008-03 


170-290 


CG5 1373-07 


106-429 


CG57008-03 


209-290 


CG5 1373-07 


159-205 


CG57008-03 


240-290 


CG51373-07 


159-255 






CG51373-07 


159-317 






CG5 1373-07 


159-355 






CG5 1373-07 


159-429 






CG5 1373-07 


200-429 






CG5 1373-07 


255-429 






CG5 1373-07 


300-429 






CG5 1373-07 


340-429 






CG5 1373-09 


159-255 






CG5 1373-09 


159-317 



Table 23. Interactions Detected in the matrix lxl assay 



Binding Domain 
(BD) Fusion 
Protein 


Amino 
Acids in 
the BD 
Fusion 
Protein 


Domain Description for BD 
Fusion Protein 


Activation Domain 
(AD) Fusion Protein 


Amino 
Acids in 
the AD 
Fusion 
Protein 


Domain Description 
for AD Fusion Protein 


CG57008-03 
(CR014) 


106-290 


Extracellular domain minus Ig 
domain, contains mucin 
domain 


CG51373-07 
(CR016) 


159-255* 


PKD domain 


CG57008-03 
(CR014) 


106-290 


Extracellular domain minus Ig 
domain, contains mucin 
domain 


CG5 1373-07 
(CR016) 


159-205* 


Partial PKD domain 


CG570O8-03 
(CR014) 


106 - 290 


Extracellular domain minus Ig 
domain, contains mucin 
domain 


CG5 1373-09 
(CR016) 


159-255 


PKD domain 


CG57008-03 
(CR014) 


60 - 290 


Extracellular domain that 
includes mucin domain and 
partial Ig domain 


CG5 1373-07 
(CR016) 


159-255* 


PKD domain 


CG57008-03 
(CR014) 


60 - 290 


Extracellular domain that 
includes mucin domain and 
partial Ig domain 


CG5 1373-07 
(CR016) 


159-205* 


Partial PKD domain 


CG57008-03 
(CR014) 


60 - 290 


Extracellular domain that 
includes mucin domain and 
partial Ig domain 


CG5 1373-09 
(CR016) 


159-255 


PKD domain 


CG5 1373-07 
(CR016) 


22 - 255* 


First set of 3 Ig domains + 
PKD domain 


CG57008-03 
(CR014) 


106-290 


Extracellular domain 
minus Ig domain, 
contains mucin domain 


CG5 1373-07 
(CR016) 


159-317 


PJCD domain plus 4th Ig 
domain 


CG57008-03 
(CR014) 


106-290 


Extracellular domain 
minus Ig domain, 
contains mucin domain 


CG5 1373-09 
(CR016) 


159-255 


PKD domain 


CG57008-03 
(CR014) 


106 - 290 


Extracellular domain 
minus Ig domain, 
contains mucin domain 
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Extracellular domain 












that includes mucin 


CG51373-09 






CG57008-03 




domain and partial Ig 


(CR016) 


159-255 


PKD domain 


(CR014) 


60-290 


domain 



Table 22 shows all the combinations and variants of CG57008 and CG51373 that 
were tested for interactions. These interactions were tested in both orientations with 
respect to yeast two-hybrid fusion proteins. The amino acid numbering for CG5 1373-07 

5 numbers the initial Methionine as amino acid #1. Table 23 shows all the interactions that 
were detected in the matrix 1x1 assay. The number of positive interactions detected and 
their detection in both orientations with respect to yeast two-hybrid fusion proteins 
confirms the discovery of a novel interaction between CG57008, HAVcr-1, variants, and 
CG51373, nephrinl variants, and specifically between the mucin domain of HAVcr-1 and 

1 0 the PKD domain of nephrin 1 . 

Example 9. Preparation of Antibodies that Bind CG57008 

Techniques for producing the antibodies are known in the art and are described, for 
example, in "Antibodies, a Laboratory Manual" Eds Harlow and Lane, Cold Spring 
Harbor publisher. Both rabbits and mice are suitable for the production of polyclonal 

15 antibodies, while mice are also suitable for the production of monoclonal antibodies. Mice 
in which the human immunoglobulin genes replace mouse immunoglobulin genes can be 
used to produce fully human monoclonal antibodies. These antibodies have better 
pharmaceutical characteristics, have little or no antibody-directed immune reactions that 
result in loss of therapeutic efficacy, and have been shown to eradicate tumors in animal 

20 model (Yang XD, et al. 9 Cancer Res. 59(6):1236^3 (1999)). 
Generation of human monoclonal antibodies 

Fully human IgG2 and IgG4 monoclonal antibodies (mAb), directed against 
CG57008-02 were generated from human antibody-producing XenoMouse strains 
engineered to be deficient in mouse antibody production and to contain the majority of the 
25 human antibody gene repertoire on megabase-sized fragments from the human heavy and 
kappa light chain loci as previously described in Yang et al, Cancer Res. 59(6): 1236-43 
(1999). The specificity of the antibodies was determined by ELISA. 

Generation of the antigen and antibodies: The extracellular domain of CG57008-02 
was subcloned to the baculovirus expression vector, pMelVSHis (CuraGen Corporation) 
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and expression studies were performed using the pBlueBac baculovirus expression system 
(Invitrogen Corporation) and confirmed by Western blot analyses. The sequence encodes 
the following polypeptide: 

SVKVGGEAGPSVTLPCHYSGAVTSMCWNRGSCSLFTCQNGIVWTNGTHVTYRKOT 
5 ENTAVSDSGVYCCRVEHRGWFNDMKITVSLEIVPPKVTTTPI 

MSI PTTTTVPTTMTVSTTTSVPTTTSI PTTTSVPVTTTVSTFVPPMPLPRQNHEPVATSPSSPQPAETHPTT 
LQGAIRREPTSSPLYSYTTDGNDTVTESSDGLWNNNQTQLFLEHSLL (SEQ ID NO: 39). 

This polypeptide was used to generate antibodies. 



10 Epitope binning and BIAcore affinity determination 
Materials 

• MxhlgG - conjugated beads (Limit Exposure to Light) 

• Wash Buffer (PBS, Tween 20 {0.05%}) 

• Blocking Buffer 

1 5 • 96-well microtiter filter plate (Milipore #MADVN 6550) 

• mxhlgG-PE 

Procedure 

1 . Prepare beads for coupling to primary unknown antibody. (Protect from light). 
20 Use individual tubes for each unknown supernatant. Calculate the volume of supernatant 
needed by using the following formula: (n+10)x50jil (where n = total number of 
samples on plate). If the concentration is known, use at 0.5ug/ml. Gently vortex bead 
stock. Dilute beads in supernatant to a concentration of 2500 of each bead per well or 
0.5X10 5 /ml. 

25 2. Incubate on a shaker in the dark at room temperature overnight, or 2 hours if at a 

known concentration of 0.5ug/ral. 

3. Pre-wet filter plate by adding 200 jil wash buffer per well. Aspirate. 

4. Add 50ul of each bead to each well of filter plate. Wash once by adding 
lOOul/well wash buffer and aspirating. 

30 5. Add antigen and controls to filter plate 50^1/well. Cover and incubate in the dark 

for 1 hour on shaker. 
6. Wash 3 times. 
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7. Add secondary unknown antibody at 50ul/well. Use the same dilution (or 
concentration if known) as used for the primary antibody. Incubate in the dark for 2 hours 
at room temperature on shaker. 

8. Wash 3 times. 

5 9. Add 50ul/well biotinylated mxhlgG diluted 1 :500. Incubate in the dark for lhour 

on shaker at room temperature. 

10. Turn on both Luminex 100 and XYP base. Open the Luminex software and 

start the "warm up" operation. Make sure the sheath fluid container has enough volume 

and the waste container has enough space so it won't overfill. 
10 1 1 . In Luminex software, click the "new session" button. The "settings" window 

will pop up. All of the default settings are correct except for the "number of samples". 

12. Enter the number of samples in this field. 

13. Click on the "bead set" tab and enter the designation numbers of the beads used 
in the assay. 

15 14. Wash 3 times. 

15. Add 50ul/wellStreptavidin-PE at 1:1000. Incubate in the dark for 15min. on 
shaker at room temperature. 

1 6. Run two wash cycles on the LuminexlOO. 

17. Wash 3 times. 

20 1 8. Resuspend each well in 80ul blocking buffer. Carefully pipette up and down 

several times to resuspend beads. 

19. Place the plate in the Luminex base to be read. Make sure position Al is 

highlighted on your screen and click start. After die last sample has been read, perform 

two wash cycles and one soak cycle. Close the Luminex software. Then, turn off both the 
25 top and bottom of the Luminex 1 00. Release pressure in sheath fluid reservoir by 

loosening the cap. 
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BIAcore Determination 

BIAcore determinations were done using methods known in the art, for example, 
"Validation parameters for a novel biosensor assay which simultaneously measures serum 
5 concentrations of a humanized monoclonal antibody and detects induced antibodies", R. 
Wong, D. Mytych, S.Jacobs, R.Bordens, S.Swanson, Journal of Immunological Methods, 
209(1997)1-15. 

Table 24 shows that the monoclonal antibodies generated belong to eight distinct 

bins. 
10 Table 24 



Bins Antibody (affinity nM by BIAcore) 



15 


1 


Mab2.59 
Mabl2.9 


(0.38) 
(3.64) 




2 


Mab2.16 


(0.79) 


20 


3 
4 


Mab2.17 

Mabl.37 
Mab2.79 
Mab2.61 


(2.42) 

(2.78) 
(0.57) 
(1.0) 


25 


5 


Mab2.24 
Mab2.56 


(2.42) 
(1.11) 




6 


Mab2.70 


(2.71) 


30 


7 


Mab2.54 


(3.35) 




8 


Mab2.45 


(1.15) 



Example 10. Specificity of the monoclonal antibodies for CG57008-02 
35 ELISA Protocol 

Solution Preparation:_Coating Buffer (0.1M Carbonate, pH9.5), 8.4 g. NaHC03, 
3.56g. Na2C03, pH to 9.5, and dilute to 1 L. with ddH20 

Assay Diluent: Phaimingen #2641 IE 
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Protocol: 

I) Coat a 96-well high protein binding ELISA plate (Corning Costar #3590) with 
50 ul. of the CG57008-02 protein at a concentration of 5ug/mL. diluted in coating buffer, 
incubate overnight at 4 °C. 

5 2) Following day wash the wells 5X 200-300 ul of 0.5% Tween-20 in PBS. 

3) Block plates with 200ul of assay diluent for at least 1 hour at room temperature. 

4) Dilute CR014 antibodies in assay diluent with the final concentrations of 7, 15, 
31.3, 62.5, 125, 250, 500 and 1000 ng/ml. An anti-V5-HRP antibody was used at 1:1000 
to detect the V5 containing peptide as the positive control for the ELISA. 

10 5) Wash plate as in step 2). 

6) Add 50ul of each antibody dilution to the proper wells, incubate for at least 2 
hours at room temp. 

7) Wash plate as in step 2). 

8) Add 50ul of secondary antibody (goat anti-human-HRP at 1:1000 and incubate 
15 for 1 hour at room temp. 

9) Wash plate as in step 2). 

10) Develop assay with lOOul of TMB substrate solution/well. (1:1 ratio of 
solution A+B) (Pharmingen #2642KK) 

II) Stop reaction with 50ul. sulfuric acid. 12) Read plate at 450nm with a 
20 correction of 550nm. 

To demonstrate the specificity of the antibodies for CG57008-02, four of the 
antibodies, CR014.1.29, CR014.2.56.2, CR014.2.59.2, and CR014.2.45.1, as well as an 
isotype matched control mAb PK16.3, were tested by ELISA for reactivity against the 
CG57008-02 antigen. (Figure 2). The X axis depicts the antibody treatments in the order 
25 listed above and the Y axis is the optical density. 

In addition, to eliminate the possibility that the observed immunoreactivity was 
directed against the V5-His tag added to the CG57008-02 construct, an unrelated protein 
containing the same V5-His tag was included in the assay as a control. (Figure 3). These 
results demonstrate that the four antibodies assayed specifically bind to the wild type 
30 CG57008-02 antigen. 
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Example 11. Immunohistoc hemic al (EHC) analysis of CG57008 expression in normal 
and tumor tissues 

Immunohistochemical (IHC) analysis of CG57008 expression in normal and tumor 
tissue specimens was performed using techniques known in the art. 
5 Immunohistochemistry staining Protocol 

Sample Preparation 

1 . Fix the tissue in 10% formalin at 4°C overnight. 

2. Paraffin embed the fixed tissue. 

3. Mount tissue sections on slides. 

10 4. Rinse the slides twice for 2 minutes in 100% alcohols (18:1:1 100% ethanol:100% 
methanol: 100% isopropanol) and twice for 2 minutes in a 95% solution of the 
100% alcohols. 

5. Place slides in an 80% solution of the 100% alcohols for 2 minutes and several 
times with fresh deionized water. 

1 5 SDS Antigen Retrieval 

1 . Place slides face-up in incubation tray and cover each section with 1% SDS in TBS 
(lOOmM Tris pH 7.4, 138mM NaCl, 27mM KC1). 

2. Incubate for five minutes at room temperature, followed by three five minute 
washes with TBS. 

20 Blocking 

1 . Immerse slides in a dish containing blocking buffer (serum from host species of 
secondary antibody to be used, diluted 1 : 1 0 in TBS). 

2. Incubate at 37°C for one hour. 

Incubation with Primary Antibodies 

25 1 . Cover the tissue sections with primary CG57008 (CR014) antibodies diluted in 

blocking buffer. 
2. Incubate for 2 hours at 37°C. 
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3. Rinse twice in TBS for five minutes each wash. 
Incubation with Secondary Antibodies 

1 . Cover the tissue sections with conjugated secondary antibody diluted in blocking 
buffer according to manufacturer's instructions. 

2. Incubate at 37°C for one hour. 

3. Rinse twice in TBS for five minutes each wash. 

4. Counterstaining and visualization. 



Table 26 summarizes the tissues that highly express CG57008-02 as detected using 
10 CG57008 2.59.2 mAb. The specimens are graded on a scale of 0-3, with a score of 1+ 

indicating that the staining is above that observed in control tissues stained with an isotype 
control irrelevant antibody. The corresponding histological specimens from one renal 
tumor and the pancreatic tumor are shown in Figures 4A and 4B. Table 25 shows all the 
tissues tested and the number of each type of tissue tested and results. In addition to these 
1 5 the renal and pancreatic tumors, specimens from head and neck cancer, ovarian cancer, 
gastric cancer, melanoma, lymphoma, prostate cancer, liver cancer, breast cancer, lung 
cancer, bladder cancer, colon cancer, esophageal cancer, and brain cancer, as well the 
corresponding normal tissues were stained with 2.59.2 mAb. 

Overall, approximately 18% of the renal cancer tissue samples and 6% of the 
20 pancreatic cancer tissue samples examined were highly positive when stained with the 
2.59.2 mAb recognizing the CG57008 antigen. No staining in normal tissues was seen. 
These results indicate that CG57008-02 is a marker of cancer in these tissues and that the 
2.59.2 antibody can be used to differentiate cancers from normal tissues. 

Table 25. 

25 



Array # 


Position Tissue 


Staining Intensity 


ITTA03354B 


A3 HEAD & NECK CA 


0 


ITTA03354B 


A7 HEAD & NECK CA 


0 


ITTA03354B 


A8 HEAD & NECK CA 


0 


ITTA03354B 


A9 HEAD & NECK CA 


0 
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ITTA03354B 


A10 


HEAD & NECK CA 


0 


ITTA03354B 


B3 


HEAD & NECK CA 


0 


ITTA03354B 


B4 


HEAD & NECK CA 


0 


ITTA03354B 


B5 


HEAD & NECK CA 


0 


ITTA03354B 


B6 


HEAD & NECK CA 


0 


ITTA03354B 


B8 


HEAD & NECK CA 


0 


ITTA03354B 


B9 


HEAD & NECK CA 


0 


ITTA03354B 


C1 


HEAD & NECK CA 


0 


ITTA03354B 


C2 


HEAD & NECK CA 


0 


ITTA03354B 


C3 


HEAD & NECK CA 


0 


ITTA03354B 


C5 


HEAD & NECK CA 


0 


ITTA03354B 


C6 


HEAD & NECK CA 


0 


ITTA03354B 


C7 


HEAD & NECK CA 


0 


ITTA03354B 


C8 


HEAD & NECK CA 


0 


ITTA03354B 


C9 


OVARIAN CA 


0 


ITTA03354B 


C10 


OVARIAN CA 


0 


ITTA03354B 


D1 


OVARIAN CA 


0 


ITTA03354B 


D2 


OVARIAN CA 


0 


ITTA03354B 


D3 


OVARIAN CA 


0 


ITTA03354B 


D4 


OVARIAN CA 


0 


ITTA03354B 


D5 


OVARIAN CA 


0 


ITTA03354B 


D6 


OVARIAN CA 


0 


ITTA03354B 


D7 


OVARIAN CA 


0 


ITTA03354B 


D8 


OVARIAN CA 


0 


ITTA03354B 


D9 


OVARIAN CA 


0 


ITTA03354B 


E3 


OVARIAN CA 


0 


ITTA03354B 


E4 


OVARIAN CA 


0 


ITTA03354B 


E5 


OVARIAN CA 


0 


ITTA03354B 


E6 


OVARIAN CA 


0 


ITTA03354B 


E7 


OVARIAN CA 


0 


ITTA03354B 


E9 


OVARIAN CA 


0 


ITTA03354B 


E10 


OVARIAN CA 


0 


ITTA03354B 


F1 


OVARIAN CA 


0 


ITTA03354B 


F2 


OVARIAN CA 


0 


ITTA03354B 


F3 


OVARIAN CA 


0 


ITTA03354B 


F4 


PANCREATIC CA 


0 


ITTA03354B 


F5 


PANCREATIC CA 


0 


ITTA03354B 


F6 


PANCREATIC CA 


0 


ITTA03354B 


F7 


PANCREATIC CA 


0 


ITTA03354B 


F9 


PANCREATIC CA 


0 


ITTA03354B 


F10 


PANCREATIC CA 


0 


ITTA03354B 


G1 


PANCREATIC CA 


0 


ITTA03354B 


G2 


PANCREATIC CA 


0 
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ITTA03354B 


G3 


PANCREATIC CA 


0 


ITTA03354B 


H4 


RENAL TUMOR 


0 


ITTA03354B 


H5 


RENAL TUMOR 




ITTA03354B 


H6 


RENAL TUMOR 


0 


ITTA03354B 


H7 


RENAL TUMOR 


0 


ITTA03354B 


H8 


RENAL TUMOR 


0 


ITTA03354B 


H9 


RENAL TUMOR 


0 


ITTA03354B 


H10 


RENAL TUMOR 




ITTA03354B 


11 


RENAL TUMOR 


0 


ITTA03354B 


12 


RENAL TUMOR 


0 


ITTA03354B 


13 


RENAL TUMOR 


0 


ITTA03354B 


14 


RENAL TUMOR 


0 


ITTA03354B 


15 


RENAL TUMOR 


0 


ITTA03354B 


17 


RENAL TUMOR 


0 


ITTA03354B 


18 


RENAL TUMOR 




ITTA03354B 


19 


RENAL TUMOR 




ITTA03354B 


no 


RENAL TUMOR 




ITTA03354B 


ji 


RENAL TUMOR 


0 


ITTA03354B 


J2 


RENAL TUMOR 


0 


ITTA03354B 


J3 


RENAL TUMOR 


0 


ITTA03354B 


J4 


RENAL TUMOR 


0 


ITTA03354B 


J6 


RENAL TUMOR 




ITTA03354B 


J7 


RENAL TUMOR 


0 


ITTA03356C 


C2 


GASTRIC CA 


0 


ITTA03356C 


C4 


GASTRIC CA 


0 


ITTA03356C 


C6 


GASTRIC CA 


0 


ITTA03356C 


D3 


GASTRIC CA 


0 


ITTA03356C 


D7 


GASTRIC CA 


0 


ITTA03356C 


D8 


GASTRIC CA 


0 


ITTA03356C 


D10 


GASTRIC CA 


0 


ITTA03356C 


E6 


MELANOMA 


0 


ITTA03693A 


A5 


LYMPHOMA 


0 


ITTA03693A 


A6 


LYMPHOMA 


0 


ITTA03693A 


C6 


PROSTATE CA 


0 


ITTA03693A 


D5 


LIVER CA 


0 


ITTA03693A 


E4 


LIVER CA 


0 


ITTA03693A 


E5 


LIVER CA 


0 


ITTA03693A 


F6 


LIVER CA 


0 


ITTA04331A 


A4 


BREAST CA 


0 


ITTA04331A 


A5 


BREAST CA 


0 


ITTA04331A 


A6 


BREAST CA 


0 


ITTA04331A 


B3 


BREAST CA 


0 


ITTA04331A 


B4 


BREAST CA 


0 
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Table 26. Summay of expression of CG57008 protein detected by 2.59.2 mAb in renal 
and. pancreatic tumors 



Tissue source 
RENAL CA 
RENAL CA 
RENAL C A 
RENAL CA 
PANCREATIC CA 



Test Article ScorelM=meinbraiie« C=cvtosolicl 
ABX-14 (2.59.2) 1+M 
ABX-14 (2.59.2) 2+M.C 
ABX-14 (2.59.2) 1+M,C 
ABX-14 (2,59.2) 2+M,C 
ABX-14 (2.59.2) 2+M,C 
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Example 12. Quantification of membrane bound CG57008 protein by Flow 
Cytometry 

FACS analysis was performed to demonstrate the specificity of the anti-CG57008 
5 antibodies for cell membrane-bound CG57008 and to identify preferred antibodies for use 
as a therapeutic or diagnostic agent The analysis was performed on four renal cancer cell 
lines, ACHN (ATCC#:CRL-161 1), CAKI-1 (ATCC#:HTB-46), CAKI-2 (ATCC#:HTB- 
47) and 786-0 (ATCC#:CRL-1932). A breast cancer cell line, BT549, that does not 
express CG57008 was used as a control. Table 27 shows that both antibodies, 2.59.2 and 
10 2.70.2, specifically bind to CG57008 expressed on ACHN, CAKI-1, CAKI-2 and 786-0 
cells, but not BT549 cells. Based on the Geo Mean Ratios normalized to the pK16 isotype 
control, irrelevant antibody, ACHN cells have a higher cell surface expression of 
CG57008 protein. 

15 Table 27 

Geo Mean Ratio (relative to pK16) 
Antibody ACHN CAKI-1 786-Q CAKI-2 BT549 



20 



2.59.2 27.8 14.8 22.8 13.8 1.4 

2.70.2 29.7 15.8 23.4 13.8 1.8 



Example 13. Antibody mediated toxin killing 

25 Kohls and Lappi, Biotechniques. 28 (1): 1 62-5 (2000) have described a clonogenic 

assay to determine if a primary antibody can induce cancer cell death when used in 
combination with a saporin toxin conjugated secondary antibody reagent 

A. Clonogenic Assay Protocol: Anti-CG57008 mAb-mediated toxin killing 

ACHN and BT549 cells were plated onto flat bottom tissue culture plates at a 
30 density of 3000 cells per well. On day 2 or once cells reach -25% confluency, 100 
ng/well secondary mAb- toxin (goat anti-human IgG-saporin; Advanced Targeting 
Systems; HUM-ZAP; cat. # IT-22) was added. EGFR, CR014.2.7.2, CR014.2.59.2, or 
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isotype control mAb was then added to each well at the desired concentration (typically 1 
to 500 ng/ml). On day 5, the cells were trypsinized, transferred to a 150 mm tissue culture 
dish, and incubated at 37 °C. Plates were examined daily. On days 10-12, all plates were 
Giemsa stained and colonies on the plates were counted. Plating efficiency was 
5 determined by counting the cells prior to transfer to 150 mm plates and compared to the 
number of colonies that eventually formed. 

The percent viability in antigen positive ACHN and antigen negative BT549 cell 
lines are presented in Figure 5 and Figure 6, respectively. In this study, the cytotoxic 
chemotherapy reagent 5-FU was used as the positive control and induced almost complete 

10 killing, whereas addition of the saporin conjugated-goat anti-human secondary antibody 
alone had no effect. A monoclonal antibody (NeoMarkers MS-269-PABX) generated 
against the EGF receptor, which is expressed by both cell lines, was used to demonstrate 
primary antibody and secondary antibody-saporin conjugate specific killing. The results 
indicate that both cell lines were susceptible to EGFR mAb mediated toxin killing at 100 

1 5 ng/ml. At the same dose, both the 2.59.2mAb and the 2.70.2 mAb induced over 90% 
ACHN cell death as compared to 0% BT549 cell death. 

B. Clonogenic Assay Protocol: Auristatin E conjugated antibody mediated toxin 
killing 

CAKI-1 and BT549 cells were plated onto flat bottom tissue culture plates at a 

20 density of 3000 cells per well. On day 2 or cells reach ~25% confluency, various 

concentrations (typically 1 to 1000 ng/ml) of unconjugated and Auristatin E-conjugated 

mAb, which included EGFR, CR014.2.7.2, CR014.2.59.2 or isotype control mAb 

(CR01 1 .2.6.2), were added to cells. On day 5, the cells were trypsinized, transferred to a 

150 mm tissue culture dish, and incubated at 37 °C. Plates were examined daily. On days 

25 10-12, all plates were Giemsa stained and colonies on the plates were counted. Plating 

efficiency was determined by counting the cells prior to transfer to 150 mm plates and 

compared to the number of colonies that eventually formed. 

The percent viability in antigen positive CAKI-1 and antigen negative BT549 cell 

lines are presented in (Figure 17 and Figure 18), respectively. CR01 1.2.6.2 was used as 

30 the isotype control. A monoclonal antibody (NeoMarkers MS-269-PABX) generated 

against the EGF receptor, which is expressed by both cell lines, was used to demonstrate 

specific killing mediated by AE-conjugated antibody. 
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The results indicate that unconjugated and AE-conjugated CR01 1 .2.6.2 mAb had 
no effect on growth of both CAKI-1 and BT549 cells. However, both cell lines were 
susceptible to AE-EGFR mAb mediated toxin killing in a dose-dependent fashion. At the 
maximum dose, both CG57008 mAbs (CRO 14.2.59.2 and 2.70.2) induced over 90 % 
5 CAKI-1 cell death when compared to their unconjugated counterparts. The response was 
dose dependent. At the same dose range, both CR014.2.59.2 and 2.70.2 mAbs did not 
affect the survival of BT549 cells. 

Example 14. Preparation and testing of chemotherapy and radio-immunoconjugated 
10 antibodies 

Cytotoxic chemotherapy or radiotherapy of cancer is limited by serious, sometimes 
life-threatening, side effects that arise from toxicities to sensitive normal cells because the 
therapies are not selective for malignant cells. Therefore, there is a need to improve the 
selectivity. One strategy is to couple therapeutics to antibodies that recognize tumor- 

1 5 associated antigens. This increases the exposure of the malignant cells to the ligand- 
targeted therapeutics but reduces the exposure of normal cells to the same agent, 
(reviewed in Allen, Nat Rev Cancer. 2(10):750-63 (2002)). 

CG57008 is one of these tumor-associated antigens, as shown by its specific 
expression on cellular membranes of tumor cells by FACS and IHC. Therefore one 

20 embodiment of the invention uses monoclonal antibodies directed against CG57008 
coupled to cytotoxic chemotherapic agents or radiotherapic agents as anti-tumor 
therapeutics. 

Depending on the intended use of the antibody, j.e., as a diagnostic or therapeutic 
reagent, radiolabels are known in the art and have been used for similar purposes. For 

25 instance, radionuclides which have been used in clinical diagnosis include I 131 , 1 125 , 1 123 , 
Tc", Ga 67 , as well as In 111 . Antibodies have also been labeled with a variety of 
radionuclides for potential use in targeted immunotherapy (Peirersz et al. (1987). The use 
of monoclonal antibody conjugates for the diagnosis and treatment of cancer. Immunol. 
Cell Biol65: 1 1 1-125). These radionuclides include Re 188 and Re 186 as well as Y 90 , and to 

30 a lesser extent Au 199 and Cu 67 . I 131 has also been used for therapeutic purposes. U.S. Pat 
No. 5,460,785 provides a listing of such radioisotopes. 
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Radiotherapeutic chelators and chelator conjugates are known in the art. For 
instance, U.S. Pat. No. 4,831,175 is directed to polysubstituted 

diethylenetriaminepentaacetic acid chelates and protein conjugates containing the same, 
and methods for their preparation. U.S. Pat Nos. 5,099,069; 5,246,692; 5,286,850; and 
5 5,124,471 also relate to polysubstituted DTPA chelates. 

Cytotoxic chemotherapies are known in the art and have been used for similar 
purposes. For instance, U.S. Pat. No 6,441,163 describes processes for the production of 
cytotoxic conjugates of maytansinoids and antibodies. The anti-tumor activity of a new 
tubulin polymerization inhibitor, auristatin PE, is also know in the art (Mohammad et al, 
10 IntJOncol. 15(2):367-72 (1999). 

Once these chemotherapy or radiolabel and antibody conjugates are made, they can 
be tested for their cytotoxic activity on CG57008-02 expressing cells. Such experiments 
use methods known in the art, such as MTS, cell counting and clonogenic assays. 

1 5 Example 15* FACS analysis of expression of CG57008-02 on CD4+ T cells 
Method 

Mononuclear cells were isolated from human blood diluted 1:1 in PBS, by 
20 spinning over Ficoll for 20 min. The mononuclear cells were washed twice at 1000 rpm 
with PBS -Mg and Ca and re-suspended in Miltenyi buffer, PBS, 0.5% BSA, 5 mM 
EDTA at approximately 10 8 cells/ml. 20 \xL of CD4 Miltenyi beads were added per 10 7 
cells and incubated for 15 min on ice. Cells were washed with a 10-fold excess volume of 
Miltenyi buffer. VS positive selection column was washed with 3 mL of Miltenyi buffer. 
25 The pelleted cells were re-suspended at 1 0 8 cells per mL of Miltenyi buffer and applied to 
the washed VS column. The column was then washed 3 times with 3 mL of Miltenyi 
buffer. Following this, the VS column was removed from the magnetic field and CD4+ 
cells were eluted from the column with 5 mL of Miltenyi buffer. Isolated CD4+ 
lymphocytes were pelleted and re-suspended in DMEM 5% FCS plus additives (non 
30 essential amino acids, sodium pyruvate, mercaptoethanol, glutamine, penicillin, and 
streptomycin) at 10 6 cells/mL. lxlO 6 freshly islolated resting CD4+ T cells were 
transferred into flow cytometry tubes and washed with 2 ml/tube FACS staining buffer 
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(FSB) containing PBS, 1% BSA and 0.05% NaN3. Cells were spun down and supernatant 
removed. Cells were blocked with 20% goat serum in FSB for 30 minutes on ice. Cells 
were washed as above and incubated with 10 jig/ml of primary human and CG57008-02 
mAb or control PK16.3 mAb in FSB (200 for 45 min on ice followed by washing. 

5 Secondary goat anti-human PE conjugated antibody was added at 1 :50 dilution for 45 
minutes on ice in the dark, washed, resuspended in 500 |il of PBS containing 1% 
formaldehyde and kept at 4°C until flow cytometry analysis was performed. 

FACS analysis was performed to determine the expression of CG57008-2 protein 
as detected with five anti-CG57008-02 antibodies. (2.59.2, 1.29, 2.70.2, 2.56.2, 2.45.1) on 

10 human and mouse resting CD4+ T cells, as well as human activated and human polarized 
CD4+ T cells. These analyses demonstrated that freshly isolated resting human CD4+ T 
cells do not express CG57008-02, while a major fraction of polarized human Th2 and Thl 
cells do express CG57008-02. (See Table 28). Table 29 demonstrates that over the course 
of 5 days, continual stimulation of T cells results in an increase in CG57008 expression, as 

1 5 measured by 2.70.2 antibody, as compared to the control PK16.3 antibody. Furthermore, 
addition of matrix metalloproteinase inhibitor (MMPI) did not measurably increase 
CG57008 expression, demonstrating that the receptor is not shed from T cells under these 
experimental conditions. Thus, expression of the CG57008 protein and specific antibody 
binding is specific to activated Thl and Th2 cells, which in turn, are characteristic of 

20 inflammatory response, specifically asthma. 
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Table 28. FACS Analysis of the Expression of the CG57008-O2 protein on human CD4+ 
Th2 Cells using five anti-cG57008-02 antibodies. The experiment is described in the left 
hand column and the labeled antibody is specified along the top row. Data is reported as 
5 the geometric mean of the fluorescence intensity. 



Experiment 


Control 
PK16.3 


CR014 
1.29 


CR014 
2.45.1 


CR014 
2.56.2 


CR014 
2.59.2 


CR014 
2.70.2 


Resting Human 
CD4+ T cells 


4.6 


4.7 


5.1 


6 


4.9 


N/A 


Polarized Human 
CD4+ Th2 Celts 


8.4 


22.3 


42.4 


564.1 


22 


27.8 



Table 29. Percent of activated T cells that express CG57008 





Day 0 


Day 1 


Day2 


Day4 


DayS 


Control 
PK16.3 


-MMPI 


1 


3 


3 


1 


1 i 


+ MMPI 


1 


2 


6 


2 


2 


CR014 


-MMPI 


1 


8 


10 


5 


13 


2.70.2 


+ MMPI 


1 


10 


14 


10 


19 



10 

Example 16. Cytokine assays 

IL-4, IL-5, IL-10, IL-13, and IFNy production levels by activated Thl and 
Th2 cell were measured in culture supernatants treated with anti-CG57008-01 antibodies 
using standard EUSA protocols. Cytokine production by Thl or Th2 cells treated with 
1 5 anti-CG57008-02 antibodies was compared to Thl or Th2 cells treated with the control 
PK16.3 antibody. In addition, the following samples were run in parallel as internal 
controls: j) anti-CD3 treated Thl or Th2 cells, where no cytokine production is expected 
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because of the absence of co-stimulation, u) anti-CD3/anti-CD28 stimulated Thl or Th2 
cells, expected to show detectable cytokine production, and Hi) untreated Thl or Th2 cells. 

CD4+ T cells were isolated as described in Example 15. Isolated CD4+ 
lymphocytes were then spun down and re-suspended in DMEM 5% FCS plus additives 

5 (non essential amino acids, sodium pyruvate, mercaptoethanol, glutamine, penicillin, and 
streptomycin) at 10 6 cells/mL. Falcon 6-well non-tissue culture treated plates were pre- 
coated overnight with anti-CD3 (2 ng/ml) and anti-CD28 (10 ng/ml) (600ul total in 
Dulbecco's PBS) overnight at 4°C. The plates were washed with PBS and CD4+ 
lymphocytes were suspended at 500,000 cells/ml in Th2 medium: DMEM+ 10% FCS plus 

10 supplements and IL-2 5ng/ml, IL-4 5 ng/ml, anti-DFN gamma 5 ng/ml and cells were 
stimulated 4-6 days 37 °C temp and 5 % C02 in the presence of 5 ng/ml of Mab 
recognizing CG57008 or isotype matched negative control mAb PK16.3. 

In another set of experiments, CD4+ lymphocytes were suspended at 500,000 
cells/ml in Thl medium: DMEM+ 10% FCS plus supplements and IL-2 5ng/ml, IL12 5 

15 ng/ml, anti-IL-4 S^g/ml and stimulated 4-6 days 37 °C temp and 5 % C02 in the presence 
of 5 ^g/ml CR014 or isotype matched control mAb PK16.3. Cells were washed 2x in 
DMEM and resuspended in DMEM, 10% FCS plus supplements and 2 ng/ml IL-2 
(500,000 cells/ml) in the presence of 5 ng/ml CR014 or control PK16.3 mAb and cultured 
(rested) for 4-6 days 37 °C temp and 5 % C02. The process of activation and resting was 

20 repeated at least once more as described above with the addition of anti-CD95L to prevent 
apoptosis of cells through FAS. Falcon 96-well non-tissue culture treated plates pre- 
coated overnight with anti-CD3 mAb at 500 ng/mL and costimulatory molecule CG55790 
(B7H2, 5ng/ml) were washed and 100 \sl of CR014 treated Thl or Th2 (200,000 cells) 
added per well. After 3 days of culture, the supernatants were removed and IL-4, IL-5, IL- 

25 10, IL-13, and IFNy levels were determined by ELISA (Pharmingen, San Diego, CA or 
R&D Systems, Minneapolis, MN). 

As demonstrated below, anti-CG57008-02 significantly inhibited release of the 
tested cytokines by Thl and Th2 cells (see Figures 7 - 16). Results where inhibition of 
cytokine production is significant (p= 02-.008), are marked on the bar chart with an 
30 asterisk. 
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Table 30 and Table 31 summarize the bar-graphs shown in Figure 7 through Figure 
16. A summary of Th2 cytokine inhibition data obtained from multiple experiments with 
different donors is provided in Table 32. Each experiment used purified CD4+ cells 
isolated from whole blood samples from two independent donors. Cytokine production is 
5 reported as the percent of cytokine production detected using the control PK16.3 mAb. 
The mAb used in each experiment is specified along the bottom row. Results that report 
significant cytokine inhibition are underlined in Table 32 below, a "ND" indicates that the 
experiment was not performed. These results do reflect donor dependent variability but 
show that mAbs 2.59.2 and 1 .29 reproducibly block one or more of the Th2 cytokines. 

10 

Table 30. Cytokine Inhibition in CD4+ Thl cells using anti-CG57008 antibodies in two 
independent human donors. Experiments that demonstrate significant inhibition of 
cytokine production are marked with an asterisk: *P= 0.01 to 0.05; **P=O.005 to 0.009; 
***P=0.001 to 0.004 

15 



Donor 12+17 


PERCENTAGE OF CONTROL ANTTBODY 


TH1 


Cytokines 
mAbs 


IL-5 


IL-4 


IL-10 


IL-13 


EST y 


2.56.2 


100.17 


28.49 * 


63.76 * 


86.45 


93.69 


2.45.1 


90.23 


39.78 * 


83.98 


96.25 


100.6 


1.29 


94.63 


81.05 


60.77 ** 


73.95 *** 


93.51 


2.59.2 


66.62 * 


31.40* 


68.99 * 


54.5 *** 


128.12 
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Table 31. Cytokine Inhibition in CD4+ Th2 cells using anti-CG57008 antibodies in two 
independent human donors. Experiments that demonstrate significant inhibition of 
cytokine production are marked with an asterisk: *P= 0.01 to 0.05; **P=0.005 to 0.009; 
5 ***P=0.001 to 0.004 



Donor 12+17 


PERCENTAGE OF CONTROL ANTIBODY 


TH2 


Cytokines 

CR014^\^ 
mAbs ^\ 


EL-5 


IL-4 


IL-10 


IL-13 


INF y 


2.56.2 


112.07 


103.46 


93.97 


86.45 


88.30 


2.45.1 


148.7 


25.66 **• 


55.97 * 


86.81 


25.66 * 


1.29 


80.26 


112.54 


44.45* 


48.91 ** 


112.54 


2.59.2 


23.62 * 


19.17** 


43.86 * 


43.71 *** 


19.18* 



Table 32. Summary of Cytokine Inhibition using mAbs 2.59.2 and 1.29 in 5 independent 
10 human donor groups. Results of experiments that report inhibition greater than 50% of that 
seen using the control PK16.3 antibody are underlined. 



^BtmorlD 
Cytokine 


12+17 


12+14 


13+14 


14 


12 


IL-4 


12 


626 


130 


ND 


ND 


IL-5 


24 


s 


122 


67 


2 


IL-10 


44 


83 


19 


45 


109 


IL-13 


44 


ND 


12 


100 


91 



2.59.2 mAb 1.29 mAb 
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Example 17. CR014 Animal Experiments 
For Inflammation: 

FITC Contact Sensitivity: Th2 model using fluorescein isothyocyanate (FITC) as 
a contact sensitizer 

5 Animals (eg. mice, rats, monkeys, humans) can develop what is called a "contact 

sensitivity" (CS) response to topical exposure with an antigen toward which they have 
prior immunity. There are two basic types of CS, one mediated by CD8 T cells (TNCB, 
DNFB contact sensitivity, poison ivy exposure), and another that triggers IL-4-secreting 
CD4 T cells (Th2) to recruit eosinphils to the site of antigen exposure. The response 

10 requires a T cell/antigen presenting cell interaction, production of "Th2" cytokines such as 
IL-4, and recruitment of eosinophils. 

Mouse strains: Any mouse strain may be used, but this assay works best in 
BALB/c mice, males or females, from 6-8 weeks of age (between 20 and 35 grams body 
weight). 

1 5 Skin sensitization: For skin sensitization, die abdomens of the mice are shaved to 

expose the skin and then painted with 10 ul of contact sensitizers (FITC in acetone & 
dibutyl pthalate). 

Ear measurement: Five to six days later, mice will be anesthetized with an 
intraperitoneal injection of pentabarbitol sodium in water (60 mg/kg body weight). A 

20 relatively long-lasting anesthetic is required for the time needed to measure ear-thickness 
and "paint" the ears with the contact sensitizer although the experienced person can 
usually work with isoflurane. The "pre-challenge" ear thickness measurements are made 
using a dial thickness gauge (e.g. an engineer's micrometer) measuring in units of 10 A 
inches. Three quick readings are made per ear (taking care to move stray hairs out of the 

25 way), and an average is taken. 

Antigen challenge: While mice remain anesthetized, the dorsal surfaces of the 
ears of immunized and control mice are painted with FITC (0.5%) in the appropriate 
diluents. After 24 hours, the "post-challenge" ear thickness measurements are made. The 
24 h change in ear thickness is proportional to the antigen-specific inflammatory response. 
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Immediately following the ear measurements, mice are sacrificed and analysis performed 
as described above. 

Th2/eosinophilia measurement: 24 hours post-challenge, the mice are 
anesthetized with isoflurane and "post-challenge" ear thickness measurements are made as 

5 above. The 24 h change in ear thickness is proportional to the magnitude of the antigen- 
specific inflammatory response. Immediately following the ear measurements, mice are 
sacrificed by C02 inhalation and their ears collected for histologic analysis. 

Mouse ovalbumin (OVA) asthma: Th2 model using ovalbumin to induce 
eosinophil influx in the lungs: Animals (e.g. mice, rats, monkeys, humans) can develop 

10 asthma in response to repeated antigen exposure to the lungs. Ovalbumin has commonly 
been used to induce eosinophil influx and airway hyper-reactivity in mice, The response 
requires a T cell/antigen presenting cell interaction, production of "Th2" cytokines such as 
IL-4, IgE production, recruitment of eosinophils, and mast cell degranulation. 

Mouse strains: Several mouse strains have been used. Some strains show better 

15 airway responses, others show better IgE, and others show clearer cytokine responses in 
the broncho-alveolar lavage (B AL) or serum. 

Asthma induction: Mice are immunized on day 0 and day 5 with OVA i.p. (8 ug) 
in alum (2 mg). Animals receive two, lh aerosol challenges with OVA (0.5% in PBS) on 
day 12, using an aerosol chamber. Control animals receive PBS aerosol. Half of the 

20 animals are tested on day 14 for effects on pulmonary function (airway hyperreactivity), 
and the other half are examined for cell types and numbers in their BAL on day 1 5. 

Pulmonary function assessment: Pulmonary conductance (GL) and pulmonary 
dynamic compliance (Cdyn) changes are measured in response to methacholine, 
administered via a jugular venous catheter to anesthetized and ventilated mice in a whole- 

25 body plethysmograph. The output signals from 8-10 consecutive breaths are analyzed 

using a computerized cross-correlation method. The GL and Cdyn peak responses to each 
dose are expressed as a percentage of the baseline values before that dose. 

BAL assessment: Animals are injected intraperitoneally with a tribromoethanol 
solution for deep anaesthesia and sacrificed. After thoracotomy, the trachea of each animal 

30 is cannulated and bronchoalveolar lavage (BAL) are performed twice with 0.5 ml PBS 

buffer 12. The collected volume is centrifuged and the supernatant is frozen at -70°C. The 

cellular content is resuspended in PBS and the cells are counted by quantification of an 
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aliquot using a haemocytometer. The leukocyte differential is determined from Diff-Quik 
(Baxter)-stained cytocentrifuged BAL fluid, and the absolute eosinophil count derived as 
the product of the leukocyte count and the eosinophil fraction. 

Other assessments: Other parameters that can be examined in this model are 1) 
5 total serum or BAL IgE, 2) OVA-specific serum or BAL IgE, and 3) serum or BAL 
cytokines. 

Mouse ear-swelling DTH : Thl models with antigens such as hen egg lysozyme 
(HEL), ovalbumin (OVA), or chicken gammaglobulin 

Thl M tuberculin-type" DTH: Animals (e.g. mice, rats, monkeys, humans) will 

10 develop what is called "delayed type hypersensitivity" (DTH) responses in response to 
intradermal or subcutaneous exposure to an antigen toward which they have prior 
immunity. There are two basic types of DTH, one being mediated by CD8 T cells (contact 
sensitivity, poison ivy exposure), and the other driven by IFN-gamma-secreting CD4 T 
cells (Thl) that recruit macrophages to the site of antigen exposure. The latter has been 

1 5 termed a tuberculin-type DTH, and shows a characteristic delayed inflammation occurring 
at 24-48 hours. The response requires a T cell/antigen presenting cell interaction, 
production of "Thl" cytokines such as IFN-gamma, and chemokines responsible for 
recruiting macrophages. 

Mouse strains: Any mouse strain may be used, but the most work has been done 

20 using BALB/c and C57BL/6 mice, males or females, from 6-8 weeks of age (between 20 
and 35 grams body weight). 

Antigen Immunization: Mice are immunized with HEL, OVA, or chicken 
gammaglobulin (100 ug/mouse) in 100 ul CFA administered subcutaneously at the base of 
the tail. 

25 Ear measurement and challenge: Seven to ten days later, the mice are 

anesthetized with an intraperitoneal injection of pentabarbitol sodium in water (60 mg/kg 
body weight). A relatively long-lasting anesthetic is required for the time needed to 
measure ear-thickness and inject the ears, although the experienced person can usually 
work with isoflurane. The "pre-challenge" ear thickness measurements are made using a 

30 dial thickness gauge (e.g. engineer's micrometer) measuring in units of 10" 4 inches. Three 

quick readings are made per ear (taking care to move stray hairs out of the way), and an 

average is taken. While mice remain anesthetized, the dorsal surfaces of the ears of 

205 



WO 03/080856 



PCT/US03/08490 



immunized and control mice are injected subcutaneously with antigen (40 ug of HEL, or 
10 ug OVA, or 10 ug chicken gammaglobulin in 10 ul of PBS), using a tuberculin syringe 
and a 30g needle. 

DTH measurement: 24 hours post-challenge, the mice are anesthetized with 
5 isoflurane and "post-challenge" ear thickness measurements are made as above. The 24 h 
change in ear thickness is proportional to the magnitude of the antigen-specific 
inflammatory response. Immediately following the ear measurements, mice are sacrificed 
by C02 inhalation and their ears collected for histologic analysis. 



10 For Kidney Cancer: Efficacy Evaluation of CR014 Auristatin E- Conjugated 

Antibody Against the 786-0 Human Renal Cell Carcinoma Xenograft in Nude Mice 

The objective of this study is to evaluate the antitumor efficacy of CR014 against a 
human renal cell carcinoma (786-0) grown as a xenograft in athymic (nude) mice. 

Test System 



Species/strain: 

Physiological state: 

Age/weight range at start of study: 

Animal supplier: 
Number/sex of animals: 
Identification: 
Randomization: 

Justification: 



Replacement: 

1 S Animal Housing and Environment 

Housing: 
Acclimation: 

Environmental conditions: 
Food/water and contaminants: 



Mouse/ CD-I nu/nu athymic 
Normal 

Animals aged 5 to 6 weeks with body weight of 
approximately 20 g 

Charles River 

TBD/Female 

Individually tattooed tails. 

Animals will be randomized after tumor 

implantation, but before treatment commences. 

Human xenograft models represent a well- 
characterized system for testing of anti-tumor 
agents. 

Animals will not be replaced during the course of 
the study. 



Static microisolators. 
1 week. 

12-hour light cycle at 21 - 22° C (70 - 72 °F) and 
40% - 60% humidity. 

Irradiated standard rodent diet (NIH31 Modified 
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Test Article 

Identity and lot number: 
Physical description: 
Source: 

Characterization/certification: 
Storage conditions: 

Stability/expiration date: 
Hazards/precautions: 

Retention of reserve samples: 
Disposition of unused test article: 

Vehicle 

Identity and lot number: 
Source: 

Storage conditions: 
Stability/expiration date: 

Test Article/Vehicle Mixture 

Dosage form: 

Dosage preparation/storage: 

Frequency of preparation: 

Stability/expiration date: 

Storage and analysis of dosing mixtures 
for concentration: 

Disposition of unused dosing mixture: 

Administration of Test Article 

Route and method of administration: 
Justification for route of administration: 
Frequency and duration of dosing: 
Administered doses: 
Administered volume(s): 



and Irradiated) consisting of: 18% protein; 5% fat; 
and 5% fiber; water (reverse osmosis, 1 ppm CI), 
ad libitum 



CR14, Batch #TBD 
Clear Liquid 

CuraGen Corporation, Branford, CT 
Purity ofBatch#TBD 

Tubes of CR14 should be stored at -70°C until 
ready for use. 

TBD 

No special handling requirements or precautions 
are needed. 

Reserve samples will be retained for confirmation. 

Unused CRM should be stored at 4°C before being 
returned to CuraGen Corporation. 

TBD 

CuraGen Corporation, Branford CT, 06405 

TBD 

TBD 



CR014 in the appropriate vehicle 

TBD 

TBD 

TBD 

TBD 

Unused CR14 should be stored at 4°C before being 
returned to CuraGen Corporation. 

Intravenous (i.v.) injection. 

This is an intended clinical route of administration. 

Every 4 days for a total of 4 doses. 

In mg/kg, see Table 

Adjust per animal weight. 
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Justification for dose levels: To determine the anti-tumor effects of an 

immunoconjugate of CR014 against the 786-0 
human renal carcinoma xenograft. 

Experimental Design 

After an acclimation period, mice are implanted subcutaneously with a total of 5 x 10 6 
786-0 renal cell carcinoma cells in a volume of 0.20 mL of serum-free medium. Animals are 
randomized and individually identified. After tumors reach a volume of 150mm 3 , treatment 
with CR014 will begin. CR014 is administered intravenously (i.v.) at the doses and schedule 
in Table 33. Mice will be observed daily for morbidity and mortality, and the tumors and 
body weight will be recorded twice weekly throughout the study period. The study design is 
shown in Table 33, and the study schedule is shown in Table 34. 



Table 33. Study Design 



Group 
Number 


Number of 
Animals 


Treatment 


Treatment 
Schedule* 


Volume 
(mL) 


1 


10 females 


Vehicle Control, i.v. 


q4dX4 


Based on 
weight 


2 


10 females 


Isotype Control, i.v. 


q4dX4 


Based on 
weight 


3 


10 females 


Paclitaxel 20 mg/kg, i.v.* 


qdX5 


Based on 
weight 


4 


10 females 


10.0mg/kgCR014#l,i.v. 


q4dX4 


Based on 
weight 


5 


10 females 


3.3 mg/kg CR014#1, i.v. 


q4dX4 


Based on 
weight 


6 


10 females 


1.0mg/kgCR014#l,i.v. 


q4dX4 


Based on 
weight 


7 


10 females 


10.0 mg/kg CR014#2, i.v. 


q4dX4 


Based on 
weight 


8 


10 females 


3.3 mg/kg CR014#2, i.v. 


q4dX4 


Based on 
weight 


9 


10 females 


1.0 mg/kg CR014#2, i.v. 


q4dX4 


Based on 
weight 



5 
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*At present, the only FDA-approved agents for renal cell carcinoma are Interferon- 
alpha and Interleukin-2, both of which produce only a 15 % objective response with an 
extremely low (<5 %) rate of durable responses. (NCI Physicians Data Query). 
Consequently, there is no effective positive control for renal cell carcinoma in the 
5 xenograft model. The drug Paclitaxel has replaced the positive control in the study design 
as a reference agent. 



Table 34: Study Schedule 


Event 


Day -7 


DayO 


Day 7 














Receipt of animals 8 


X 


















Tumor Implantation 




X 
















Body weights 




X 
















Scheduled termination 









































Tumor implantation: Tumor cells are harvested from sub-confluent exponentially 
growing cell cultures. Cells are counted and evaluated for viability using trypan blue 
10 before being suspended in serum-free medium. A total of 5 x 10 6 cells in a volume of 0.20 
mL are implanted subcutaneously in the flanks of mice. For this study, matrigel is not used 
to enhance tumor take rates. 

Tumor Measurement and Volume Determination: Tumor growth is measured 
and recorded 3 times a week using a Vernier caliper. Length and width are measured for 
1 5 each tumor. Tumor weight is determined using the following formula: 

Tumor Weight (mg) = w 2 x / 

2 
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Clinical Observations/Signs: Animals are observed daily for significant clinical 
signs, moribidity and mortality. 

Animals Found Dead or Moribund: Percentage of animal mortality and time to 
5 death will be recorded for every group in the study. Mice may be defined as moribund and 
sacrificed if one or more of the following criteria are met: 

1) Loss of body weight of 20% or greater in a 2- week period. 

2) Tumors that inhibit normal physiological function such as eating, drinking, 
mobility and ability to urinate and or defecate. 

10 3) Tumors that exceed a maximum dimension of 2000 mg as measured by calipers. 

4) Ulcerated tumors, or tumors which bleed or produce exudates. 

5) Prolonged diarrhea leading to weight loss. 

6) Persistent wheezing and respiratory distress. 

Animals can also be considered moribund if there is prolonged or excessive pain or 
1 5 distress as defined by clinical observations such as: prostration, hunched posture, 

paralysis/paresis, distended abdomen, ulcerations, abscesses, seizures and/or hemorrhages. 

Statistics (TGI): The One-Way Analysis of Variance (ANOVA) and Mann- 
Whitney U test (analyzing means and medians, respectively) were used to determine the 
statistical significance of any difference between the mean group tumor weights on the day 
20 of %TGI calculations. Dunnett's test was applied to groups with unequal sample size and 
Welch's correction was applied when populations of unequal variance were compared. 
All statistical analyses were conducted at an a level of 0.05 (two-tailed). Prism 
(GraphPad) version 3.0 was used for all statistical analyses and graphic presentations. 
Statistics (TGD): The Logrank test was used to compare differences in overall 
25 survival experience between treated and control groups. Statistical analyses were 

conducted at an a level of 0.05 (two-tailed). Prism (GraphPad) version 3 was used for all 
statistical analyses and graphic presentation. 

1 . Tumor volumes, doubling rate and tumor growth delay. 

2. Tumor growth delay at 500, 1000, 1 500 and 2000 mm 3 . 

30 3. Animal weight charts with percent weight change for treatment groups. 
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Final Report: The final report includes a summary of the methods and the raw data as 
well as results on tumor size, tumor weight, rate of tumor growth/cell line/sex, tumor 
growth delay, animal weights, percentage mortality, time to death and clinical 
observations. 

5 

OTHER EMBODIMENTS 

Although particular embodiments have been disclosed herein in detail, this has 
been done by way of example for purposes of illustration only, and is not intended to be 
limiting with respect to the scope of the appended claims, which follow. In particular, it is 

10 contemplated by the inventors that various substitutions, alterations, and modifications 
may be made to the invention without departing from the spirit and scope of the invention 
as defined by the claims. Other aspects, advantages, and modifications considered to be 
within the scope of the following claims. The claims presented are representative of the 
inventions disclosed herein. Other, unclaimed inventions are also contemplated. 

1 5 Applicants reserve the right to pursue such inventions in later claims. 
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CLAIMS 

What is claimed is: 

1 . An isolated polypeptide comprising the mature form of an amino acid 
sequenced selected from the group consisting of SEQ ID NO:2n, wherein n is an integer 
between 1 and 12. 

2. An isolated polypeptide comprising an amino acid sequence selected from 
the group consisting of SEQ ID NO:2n, wherein n is an integer between I and 12. 

3. An isolated polypeptide comprising an amino acid sequence which is at 
least 95% identical to an amino acid sequence selected from the group consisting of SEQ 
ID NO:2n, wherein n is an integer between 1 and 12. 

4. An isolated polypeptide, wherein the polypeptide comprises an amino acid 
sequence comprising one or more conservative substitutions in the amino acid sequence 
selected from the group consisting of SEQ ID NO:2n, wherein n is an integer between 1 
and 12. 

5. The polypeptide of claim 1 wherein said polypeptide is naturally occurring. 

6. A composition comprising the polypeptide of claim 1 and a carrier. 

7. A kit comprising, in one or more containers, the composition of claim 6. 

8. The use of a therapeutic in the manufacture of a medicament for treating a 
syndrome associated with a human disease, the disease selected from a pathology 
associated with the polypeptide of claim 1 , wherein the therapeutic comprises the 
polypeptide of claim 1 . 
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9. A method for determining the presence or amount of the polypeptide of 
claim 1 in a sample, the method comprising: 

(a) providing said sample; 

(b) introducing said sample to an antibody that binds immunospecifically to the 
polypeptide; and 

(c) determining the presence or amount of antibody bound to said polypeptide, 
thereby determining the presence or amount of polypeptide in said sample. 

10. A method for determining the presence of or predisposition to a disease 
associated with altered levels of expression of the polypeptide of claim 1 in a first 
mammalian subject, the method comprising: 

a) measuring the level of expression of the polypeptide in a sample from the 
first mammalian subject; and 

b) comparing the expression of said polypeptide in the sample of step (a) to 
the expression of the polypeptide present in a control sample from a second 
mammalian subject known not to have, or not to be predisposed to, said 
disease, 

wherein an alteration in the level of expression of the polypeptide in the first subject as 
compared to the control sample indicates the presence of or predisposition to said disease. 

11. A method of identifying an agent that binds to the polypeptide of claim 1 , 
the method comprising: 

(a) introducing said polypeptide to said agent; and 

(b) determining whether said agent binds to said polypeptide. 

12. The method of claim 1 1 wherein the agent is a cellular receptor or a 
downstream effector. 

13. A method for identifying a potential therapeutic agent for use in treatment 
of a pathology, wherein the pathology is related to aberrant expression or aberrant 
physiological interactions of the polypeptide of claim 1, the method comprising: 
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(a) providing a cell expressing the polypeptide of claim 1 and having a 
property or function ascribable to the polypeptide; 

(b) contacting the cell with a composition comprising a candidate substance; 
and 

(c) determining whether the substance alters the property or function ascribable 
to the polypeptide; 

whereby, if an alteration observed in the presence of the substance is not observed when 
the cell is contacted with a composition in the absence of the substance, the substance is 
identified as a potential therapeutic agent. 

14. A method for screening for a modulator of activity of or of latency or 
predisposition to a pathology associated with the polypeptide of claim 1, said method 
comprising: 

(a) administering a test compound to a test animal at increased risk for a 
pathology associated with the polypeptide of claim 1, wherein said test 
animal recombinantly expresses the polypeptide of claim 1; 

(b) measuring the activity of said polypeptide in said test animal after 
administering the compound of step (a); and 

(c) comparing the activity of said polypeptide in said test animal with the 
activity of said polypeptide in a control animal not administered said 
polypeptide, wherein a change in the activity of said polypeptide in said test 
animal relative to said control animal indicates the test compound is a 
modulator activity of or latency or predisposition to, a pathology associated 
with the polypeptide of claim 1 . 

15. The method of claim 14, wherein said test animal is a recombinant test 
animal that expresses a test protein transgene or expresses said transgene under the control 
of a promoter at an increased level relative to a wild-type test animal, and wherein said 
promoter is not the native gene promoter of said transgene. 

1 6. A method for modulating the activity of the polypeptide of claim 1 , the 

method comprising contacting a cell sample expressing the polypeptide of claim 1 with a 
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compound that binds to said polypeptide in an amount sufficient to modulate the activity 
of the polypeptide. 

1 7. A method of treating or preventing a pathology associated with the 
polypeptide of claim 1, the method comprising administering the polypeptide of claim 1 to 
a subject in which such treatment or prevention is desired in an amount sufficient to treat 
or prevent the pathology in the subject. 

1 8. The method of claim 1 7, wherein the subject is a human. 

19. A method of treating a pathological state in a mammal, the method 
comprising administering to the mammal a polypeptide in an amount that is sufficient to 
alleviate the pathological state, wherein the polypeptide is a polypeptide having an amino 
acid sequence at least 95% identical to a polypeptide comprising the amino acid sequence 
selected from the group consisting of SEQ ID NO:2n, wherein n is an integer between 1 
and 12 or a biologically active fragment thereof. 

20. An isolated nucleic acid molecule comprising a nucleic acid sequence 
selected from the group consisting of SEQ ID NO:2n-l, wherein n is an integer between 1 
and 12. 

21 . The nucleic acid molecule of claim 20, wherein the nucleic acid molecule is 
naturally occurring. 

22. A nucleic acid molecule, wherein the nucleic acid molecule differs by a 
single nucleotide from a nucleic acid sequence selected from the group consisting of SEQ 
ID NO: 2n-l, wherein n is an integer between 1 and 12. 

23 . An isolated nucleic acid molecule encoding the mature form of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID 
NO:2n, wherein n is an integer between 1 and 12. 
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24. An isolated nucleic acid molecule comprising a nucleic acid selected from 
the group consisting of 2n-l, wherein n is an integer between 1 and 12. 

25. The nucleic acid molecule of claim 20, wherein said nucleic acid molecule 
hybridizes under stringent conditions to the nucleotide sequence selected from the group 
consisting of SEQ ID NO: 2n-l , wherein n is an integer between 1 and 12, or a 
complement of said nucleotide sequence. 

26. A vector comprising the nucleic acid molecule of claim 20. 

27. The vector of claim 26, further comprising a promoter operably linked to 
said nucleic acid molecule. 

28. A cell comprising the vector of claim 26. 

29. An antibody that immunospecifically binds to the polypeptide of claim 1 . 

30. The antibody of claim 29, wherein the antibody is a monoclonal antibody. 

3 1 . The antibody of claim 29, wherein the antibody is a humanized antibody. 

32. A method for determining the presence or amount of the nucleic acid 
molecule of claim 20 in a sample, the method comprising: 

(a) providing said sample; 

(b) introducing said sample to a probe that binds to said nucleic acid molecule; 
and 

(c) determining the presence or amount of said probe bound to said nucleic 
acid molecule, 

thereby determining the presence or amount of the nucleic acid molecule in said sample. 

33. The method of claim 32 wherein presence or amount of the nucleic acid 

molecule is used as a marker for cell or tissue type. 
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34. The method of claim 33 wherein the cell or tissue type is cancerous. 

35. A method for determining the presence of or predisposition to a disease 
associated with altered levels of expression of the nucleic acid molecule of claim 20 in a 
first mammalian subject, the method comprising: 

a) measuring the level of expression of the nucleic acid in a sample from the 
first mammalian subject; and 

b) comparing the level of expression of said nucleic acid in the sample of step 
(a) to the level of expression of the nucleic acid present in a control sample 
from a second mammalian subject known not to have or not be predisposed 
to, the disease; 

wherein an alteration in the level of expression of the nucleic acid in the first subject as 
compared to the control sample indicates the presence of or predisposition to the disease. 

36. A method of producing the polypeptide of claim 1 , the method comprising 
culturing a cell under conditions that lead to expression of the polypeptide, wherein said 
cell comprises a vector comprising an isolated nucleic acid molecule comprising a nucleic 
acid sequence selected from the group consisting of SEQ ID NO:2n-l, wherein n is an 
integer between 1 and 12. 

37. The method of claim 36 wherein the cell is a bacterial cell. 

38. The method of claim 36 wherein the cell is an insect cell. 

39. The method of claim 36 wherein the cell is a yeast cell. 

40. The method of claim 36 wherein the cell is a mammalian cell. 

41. A method of producing the polypeptide of claim 2, the method comprising 

culturing a cell under conditions that lead to expression of the polypeptide, wherein said 

cell comprises a vector comprising an isolated nucleic acid molecule comprising a nucleic 
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acid sequence selected from the group consisting of SEQ ED NO:2n-l, wherein n is an 
integer between 1 and 12. 

42. The method of claim 41 wherein the cell is a bacterial cell. 

43. The method of claim 41 wherein the cell is an insect cell. 

44. The method of claim 41 wherein the cell is a yeast cell. 

45. The method of claim 41 wherein the cell is a mammalian cell. 

46. The antibody of claim 29, wherein the antibody is conjugated to a cytotoxic 
agent selected from the group consisting of a chemotherapeutic agent, a toxin, and a 
radioactive isotope. 

47. A method for treating renal cancer comprising administering the antibody 
of claim 46 to a patient suffering from renal cancer. 

48. A method for treating inflammation comprising administering the antibody 
of claim 29 to a patient in need thereof. 

49. A purified protein complex comprising a first polypeptide and a second 
polypeptide, wherein said complex comprises the amino acid sequences of a first 
polypeptide (NOV1), and a second polypeptide (NOV2). 

50. An antibody which immunospecifically binds to the protein complex of 
claim 49, which does not immunospecifically bind a NOV1 protein or a NOV2 protein 
which are not part of a NOV1-NOV2 protein complex. 

5 1 . The antibody of claim 50, wherein the binding of the antibody to the 
NOV1-NOV2 protein complex disrupts the interaction between NOV1 and NOV2. 
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52. The antibody of claim 50, wherein the binding of the antibody to the 
NOV1-NOV2 protein complex prevents the formation of the complex. 
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TOXIN-KILUNG: CLONOGENIC ASSAY OF ACHN CELLS 
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TOXIN-KILLING: CLONOGENIC ASSAY OF BTS49 CELLS 
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